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RECOMBINANT BONE MORPHOGENETIC PROTEIN HETERODIMERS, 
COMPOSITIONS AND METHODS OF USE 

Field of the Invention 
5 The present invention relates to a series of 

novel recombinant heterodimeric proteins useful in the 
field of treating bone defects, healing bone injury and 
in wound healing in general. The invention also relates 
to methods for obtaining these heterodimers, methods for 
10 producing them by recombinant genetic engineering 
techniques, and compositions containing them. 

Background of the Invention 

In recent years, protein factors which are 
characterized by bone or cartilage growth inducing 

15 properties have been isolated and identified. See, e.g., 
U. S. Patent No. 5,013,649, PCT published application 
WO90/11366; PCT published application WO91/05802 and the 
variety of references cited therein. See, also, 
PCT/US90/05903 which discloses a protein sequence termed 

20 OP-1, which is substantially similar to human BMP-7, and 
has been reported to have osteogenic activity. 

A family of individual bone morphogenetic 
proteins (BMPs) , termed BMP-2 through BMP-9 have been 
isolated and identified. Incorporated by reference for 

25 the purposes of providing disclosure of these proteins 
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and methods of producing them are co-owned, co-pending U. 
S. Patent Application SN 721,847 and the related 
applications recited in its preamble. Of particular 
interest, are the proteins termed BMP -2 and BMP-4, 
5 disclosed in the above-referenced application; BMP-7, 

disclosed in SN 438,919; BMP-5, disclosed in SN 370,547 
and SN 356,033; and BMP-6, disclosed in SN 370,544 and SN 
347,559; and BMP-8 , disclosed in SN 525,357. Additional 
members of the BMP family include BMP-1, disclosed in SN 

10 655,578; BMP-9, disclosed in SN 720,590; and BMP-3, 

disclosed in SN 179,197 and PCT publication 89/01464. 
These applications are incorporated herein by reference 
for disclosure of these BMPs. 

There remains a need in the art for other 

15 proteins and compositions useful in the fields of bone 
and wound healing. 

SUTnmaT- y of the Tnvention 

In one aspect, the invention provides a method 
for producing a recombinant heterodimeric protein having 

20 bone stimulating activity comprising culturing a selected 
host cell containing a polynucleotide sequence encoding a 
first selected BMP or fragment thereof and a 
polynucleotide sequence encoding a second selected BMP or 
fragment thereof. The resulting co-expressed, 

25 biologically active heterodimer is isolated from the 
culture medium. 

According to one embodiment of this invention, 
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the host cell may be co-trans fee ted with one or more 
vectors containing coding sequences for one or more BMPs. 
Each BMP polynucleotide sequence may be present on the 
same vector or on individual vectors transf ected into the 
5 cell. Alternatively, the BMPs or their fragments may be 
incorporated into a chromosome of the host cell. 
Additionally, a single transcription unit may encode 
single copy of two genes encoding a different BMP. 

According to another embodiment of this 

10 invention, the selected host cell containing the two 
polypeptide encoding sequences is a hybrid cell line 
obtained by fusing two selected, stable host cells, each 
host cell transf ected with, and capable of stably 
expressing, a polynucleotide sequence encoding a selected 

15 first or second BMP or fragment thereof. 

In another aspect of the present invention, 
therefore, there are provided recombinant heterodimeric 
proteins comprising a protein or fragment of a first BMP 
in association with a protein or fragment of a second 

20 BMP. The heterodimer may be characterized by bone 

stimulating activity. The heterodimers may comprise a 
protein or fragment of BMP-2 associated with a protein or 
fragment of either BMP-5 , BMP- 6 , BMP -7 or BMP -8 ; or a 
protein or fragment of BMP-4 associated with a protein or 

25 fragment of either BMP-5, BMP- 6 , BMP -7 or BMP-8. In 
further embodiments the heterodimers may comprise a 
protein or fragment of BMP-2 associated with a protein or 
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fragment of either BMP-1, BMP-3 or BMP-4. BMP-4 may also 
form a heterodimer in association with BMP-1, BMP-2 or a 
fragment thereof. Still further embodiments may comprise 
heterodimers involving combinations of BMP-5, BMP-6, BMP- 
5 7 and BMP-8. For example, the heterodimers may comprise 
BMP-5 associated with BMP-6, BMP-7 or BMP-8; BMP-6 
associated with BMP-7 or BMP-8; or BMP-7 associated with 
BMP-8. These heterodimers may be produced by co- 
expressing each protein in a selected host cell and 

10 isolating the heterodimer from the culture medium. 

. As a further aspect of this invention a cell 
line is provided which comprises a first polynucleotide 
sequence encoding a first BMP or fragment thereof and a 
second polynucleotide sequence encoding a second BMP or 

15 fragment thereof, the sequences being under control of 
one or more suitable expression regulatory systems 
capable of co-expressing the BMPs as a heterodimer. The 
cell line may be transf ected with one or more than one 
polynucleotide molecule. Alternatively, the cell line 

20 may be a hybrid cell line created by cell fusion as 
described above. 

Another aspect of the invention is a 
polynucleotide molecule or plasmid vector comprising a 
polynucleotide sequence encoding a first selected BMP or 

25 fragment thereof and a polynucleotide sequence encoding a 
second selected BMP or fragment thereof. The sequences 
are under the control of at least one suitable regulatory 
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sequence capable of directing co-expression of each 
protein or fragment. The molecule may contain a single 
transcription unit containing a copy of both genes, or 
more than one transcription unit, each containing a copy 
5 of a single gene. 

As still another aspect of this invention there 
is provided a method for producing a recombinant dimeric 
or heterodimeric protein having bone stimulating activity 
in a prokaryotic oell comprising culturing a selected 

10 host cell containing a polynucleotide sequence encoding a 
first selected BMP or fragment thereof; culturing a 
second selected host cell containing a polynucleotide 
sequence encoding a second selected BMP or fragment 
thereof; isolating monomeric forms of each BMP protein 

15 from the culture medium and co-assembling a monomer of 
the first protein with a monomer of the second protein. 
The first protein and the second protein may be the same 
or different BMPs. The resulting biologically active 
dimer or heterodimer is thereafter isolated from the 

20 mixture. Preferred cells are £^ coli . 

Thus, as further aspects of this invention 
recombinant BMP dimers or heterodimers produced in 
eukaryotic cells are provided, as well as suitable 
vectors or plasmids, and selected transformed cells 

25 useful in such a production method. 

Other aspects and advantages of the present 
invention are described further in the following detailed 
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description of preferred embodiments of the present 
invention. 

jRriaf Description o f the Figures 

Figure 1 provides the DNA and amino acid 
5 sequences of human BMP-2 (SEQ ID NOs: 1 and 2) . 

Figure 2 provides the DNA and amino acid 
sequences of human BMP-4 (SEQ ID NOs: 3 and 4) . 

Figure 3 provides the DNA and amino acid 
sequences of human BMP-7 (SEQ ID NOs: 5 and 6) . 
10 Figure 4 provides the DNA and amino acid 

sequences of human BMP-6 (SEQ ID NOs: 7 and 8). 

Figure 5 provides the DNA and amino acid 
sequences of human BMP-5 (SEQ ID NOs: 9 and 10) . 

Figure 6 provides the DNA and amino acid 
15 sequences of human BMP-8 (SEQ ID NOs: 11 and 12). 

Figure 7 provides the DNA sequence of vector 
PALB2-781 containing the mature portoin of the BMP-2 gene 
(SEQ ID NOs: 13 and 14). 

Figure 8 compares the activity of CHO BMP-2 and 
20 CHO BMP-2/7 in the W20 alkaline phosphatase assay. 

Figure 9 compares the activity of CHO BMP-2 and 
CHO BMP-2/7 in the BGP (osteocalcin) assay. 

Figure 10 provides a comparison of the W-20 
activity of la. coli produced BMP-2 and BMP-2/7 
25 heterodimer. 

Figure 11 depicts BMP-3 DNA and amino acid sequence. 
Figure 12 provides a comparison of BMP-2 and BMP-2/ 6 
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in the W-20 assay. 

Figure 13 provides a comparison of the in vivo 
activity of BMP-2/6 and BMP-2. 

Figure 14 provides a comparison of BMP-2, BMP-6 and 
BMP-2/6 j.n vivo activity. 



Detailed Description of the Invention 

The present invention provides a method for 
producing recombinant heterodimeric proteins having bone 
stimulating activity, as well as the recombinant 

10 heterodimers themselves, and compositions containing them 
for bone-stimulating or repairing therapeutic use. 

As used throughout this document, the term 
'heterodimer' is defined as a biologically-active protein 
construct comprising the association of two different BMP 

15 protein monomers or active fragments thereof joined 
through at least one covalent, disulfide linkage. A 
heterodimer of this invention may be characterized by the 
presence of between one to seven disulfide linkages 
between the two BMP component strands. 

20 According to the present invention, therefore, 

a method for producing a recombinant BMP heterodimer 
according to this invention comprises culturing a 
selected host cell containing a polynucleotide sequence 
encoding a first selected BMP or a biologically active 

25 fragment thereof and a polynucleotide sequence encoding a 
second selected BMP or a fragment thereof. The resulting 
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co-expressed, biologically active heterodimer is formed 
within the host cell, secreted therefrom and isolated 
from the culture medium. Preferred embodiments of 
methods for producing the heterodimeric proteins of this 
5 invention, are described in detail below and in the 

following examples. Preferred methods of the invention 
involve known recombinant genetic engineering techniques 
[See, e.g., Sambrook et al, "Molecular Cloning. A 
Laboratory Manual;", 2d edition, Cold Spring Harbor 
10 Laboratory, Cold Spring Harbor, NY (1989)]. However, 
other methods, such as conventional chemical synthesis 
may also be useful in preparing a heterodimer of this 
invention. 

BMP heterodimers generated by this method are 
15 produced in a mixture of homodimers and heterodimers. 
This mixture of heterodimers and homodimers may be 
separated from contaminants in the culture medium by 
resort to essentially conventional methods, such as 
classical protein biochemistry or affinity antibody 
20 columns specific for one of the BMPs making up the 

heterodimer. Additionally, if desired, the heterodimers 
may be separated from homodimers in the mixture. Such 
separation techniques allow unambiguous determination of 
the activity of the heterodimeric species. Example 4 
25 provides one presently employed purification scheme for 
this purpose. 

Preferably the recombinant heterodimers of this 
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invention produced by these methods involve the BMPs 
designated human BMP-2, human BMP-4, human BMP-5, human 
BMP-6, human BMP -7 and BMP-8. However, BMP- 3 has also 
been determined to form an active heterodimer with BMP-2* 
5 Other species of these BMPs as well as BMPs than those 
specifically identified above may also be employed in 
heterodimers useful for veterinary, diagnostic or 
research use. However, the human proteins, specifically 
those proteins identified below, are preferred for human 

10 pharmaceutical uses. 

Human BMP-2 is characterized by containing 
substantially the entire sequence, or fragments, of the 
amino acid sequence and DNA sequence disclosed in Figure 
1. Human BMP-2 proteins are further characterized as 

15 disulf ide-linked dimers and homodimers of mature BMP-2 

subunits. Recombinantly-expressed BMP-2 subunits include 
protein species having heterogeneous amino termini. One 
BMP-2 subunit is characterized by comprising amino acid 
#249 (Ser) - #396 (Arg) of Figure l (SEQ ID NOs: 1 and 

20 2). Another BMP-2 subunit is characterized by comprising 
amino acid #266 (Thr) - #396 (Arg) of Figure 1. Another 
BMP-2 subunit is characterized by comprising amino acid 
#296 (Cys) - #396 (Arg) of Figure 1. A mature BMP-2 
subunit is characterized by comprising amino acid #283 

25 (Gin) - #396 (Arg) of Figure 1. This latter subunit is 

the presently most abundant protein species which results 
from recombinant expression of BMP-2 (Figure 1) . 
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However, the proportions of certain species of BMP-2 
produced may be altered by manipulating the culture 
conditions. BMP-2 may also include modifications of the 
sequences of Figure 1, e.g., deletion of amino acids 
5 #241-280 and changing amino acid #245 Arg to lie, among 
other changes. 

As described in detail in United States Patent 
Application SN 721,847, incorporated by reference herein, 
human BMP-2 may be produced by culturing a cell 

10 transformed with a DNA sequence comprising the nucleotide 
coding sequence from nucleotide #356 to #1543 in Figure 1 
and recovering and purifying from the culture medium one 
or more of the above- identified protein species, 
substantially free from other proteinaceous materials 

15 with which it is co-produced. Human BMP-2 proteins are 
characterized by the ability to induce bone formation. 
Human BMP-2 also has in vitro activity in the W20 
bioassay. Human BMP-2 is further characterized by the 
ability to induce cartilage formation. Human BMP-2 may 

20 be further characterized by the ability to demonstrate 

cartilage and/ or bone formation activity in the rat bone 
formation assay described in the above-referenced 
application. 

Human BMP-4 is characterized by containing 

25 substantially the entire sequence, or fragments, of the 
amino acid sequence and DNA sequence disclosed in Figure 
2 (SEQ ID NOs: 3 and 4) . Human BMP-4 proteins are 
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further characterized as disulfide- linked diners and 
homodimers of mature BMP-4 subunits. Recombinantly- 
expressed BMP-4 subunits nay include protein species 
having heterogeneous anino termini. A mature subunit of 
5 human BMP-4 is characterized by an amino acid sequence 

comprising amino acids #293 (Ser) - #408 (Arg) of Figure 
2. Other amino termini of BMP-4 may be selected from the 
sequence of Figure 2. Modified versions of BMP-4, 
including proteins further truncated at the amino or 

10 carboxy termini, may also be constructed by resort to 
conventional mutagenic techniques. 

As disclosed in above- incorporated patent 
application SN 721,847, BMP-4 may be produced by 
culturing a cell transformed with a DNA sequence 

15 comprising the nucleotide coding sequence from nucleotide 
#403 to nucleotide #1626 in Figure 2 and recovering and 
purifying from the culture medium a protein containing 
the amino acid sequence from amino acid #293 to #408 as 
shown in Figure 2, substantially free from other 

20 proteinaceous materials with which it is co-produced. 

BMP-4 proteins are capable of inducing the formation of 
bone. BMP-4 proteins are capable of inducing formation 
of cartilage. BMP-4 proteins are further characterized 
by the ability to demonstrate cartilage and/or bone 

25 formation activity in the rat bone formation assay. 

Human BMP-7 is characterized by containing 
substantially the entire sequence, or fragments, of the 
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amino acid sequence and DNA sequence disclosed in Figure 
3. Human BMP-7 proteins are further characterized as 
disulf ide-linked dimers and homodimers of mature BMP-7 
subunits. Recombinantly-expressed BMP-7 subunits include 
5 protein species having heterogeneous amino termini. One 
BMP-7 subunit is characterized by comprising amino acid 
#293 (Ser) - #431 (His) of Figure 3 (SEQ ID NOs: 5 and 
6) . This subunit is the most abundantly formed protein 
produced by recombinant expression of the BMP-7 sequence. 

10 Another BMP-7 subunit is characterized by comprising 

amino acids #300 (Ser) - #431 (His) of Figure 3. Still 
another BMP-7 subunit is characterized by comprising 
amino acids #316 (Ala) - #431 (His) of Figure 3. Other 
amino termini of BMP-7 may be selected from the sequence 

15 of Figure 3. Similarly, modified versions, including 
proteins further truncated at the amino or carboxy 
termini, of BMP-7 may also be constructed by resort to 
conventional mutagenic techniques. 

As disclosed in above- incorporated patent 

20 application SN 438,919, BMP-7 may be produced by 
culturing a cell transformed with a DNA sequence 
comprising the nucleotide coding sequence from nucleotide 
#97 to nucleotide #1389 in Figure 3 and recovering and 
purifying from the culture medium a protein containing 

25 the amino acid sequence from amino acid #293 to #431 as 
shown in Figure 3, substantially free from other 
proteinaceous or contaminating materials with which it is 
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co-produced. These proteins are capable of stimulating, 
promoting, or otherwise inducing cartilage and/or bone 
formation. 

Human BMP-6 is characterized by containing 
5 substantially the entire sequence, or fragments, of the 

amino acid sequence and DNA sequence disclosed in Figure 
4. Human BMP-6 proteins are further characterized as 
disulf ide-linked dimers of mature BMP-6 subunits. 
Recombinantly-expressed BMP-6 subunits may include 

10 protein species having heterogeneous amino termini. One 
BMP-6 subunit is characterized by comprising amino acid 
#375 (Ser) - #513 (His) of Figure 4 (SEQ ID NOs: 7 and 
8). Other amino termini of BMP-6 may be selected from 
the sequence of Figure 4. Modified versions, including 

15 proteins further truncated at the amino or carboxy 

termini, of BMP-6 may also be constructed by resort to 
conventional mutagenic techniques. 

As described in detail in United States Patent 
Application SN 490,033, incorporated by reference herein, 

20 human BMP-6 may be produced by culturing a cell 

transformed with a DNA sequence comprising the nucleotide 
coding sequence from nucleotide #160 to #1698 in Figure 4 
and recovering and purifying from the culture medium a 
protein comprising amino acid #375 to #513 of Figure 4, 

25 substantially free from other proteinaceous materials or 
other contaminating materials with which it is co- 
produced. Human BMP-6 may be further characterized by 
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the ability to demonstrate cartilage and/or bone 
formation activity in the rat bone formation assay. 

Human BMP-5 is characterized by containing 
substantially the entire sequence, or fragments, of the 
5 amino acid sequence and DNA sequence disclosed in Figure 
5 (SEQ ID NOs: 9 and 10) . Human BMP-5 proteins are 
further characterized as disulf ide-linked dimers of 
mature BMP-5 subunits. Recombinantly-expressed BMP-5 
subunits may include protein species having heterogeneous 

10 amino termini. One BMP-5 subunit is characterized by 

comprising amino acid #329 (Ser) - #454 (His) of Figure 
5. Other amino termini of BMP-5 may be selected from the 
sequence of Figure 5. Modified versions, including 
proteins further truncated at the amino or carboxy 

15 termini, of BMP-5 may also be constructed by resort to 
conventional mutagenic techniques. 

As described in detail in United States Patent 
Application SN 588,227, incorporated by reference herein, 
human BMP-5 may be produced by culturing a cell 

20 transformed with a DNA sequence comprising the nucleotide 
coding sequence from nucleotide #701 to #2060 in Figure 5 
and recovering and purifying from the culture medium a 
protein comprising amino acid #329 to #454 of Figure 5, 
substantially free from other proteinaceous materials or 

25 other contaminating materials with which it is co- 
produced. Human BMP-5 may be further characterized by 
the ability to demonstrate cartilage and/or bone 
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formation activity in the rat bone formation assay 
described in the above-referenced application. 

Human BMP-8 is characterized by containing 
substantially the entire sequence, or fragments, of the 
5 amino acid sequence and DNA sequence disclosed in Figure 
6. Human BMP-8 proteins may be further characterized as 
disulf ide-linked dimers of mature BMP-8 subunits. 
Recombinantly-expressed BMP-8 subunits may include 
protein species having heterogeneous amino termini, A 

10 BMP-8 sequence or subunit sequence comprises amino acid 
#143 (Ala) - #281 (His) of Figure 6 (SEQ ID NOs: 11 and 
12) . Other amino termini of BMP-8 may be selected from 
the sequence of Figure 6. Modified versions, including 
proteins further truncated at the amino or carboxy 

15 termini, of BMP-8 may also be constructed by resort to 
conventional mutagenic techniques. 

As described generally in United States Patent 
Application SN 525,357, incorporated by reference herein, 
and as further described herein, human BMP-8 may be 

20 produced by culturing a cell transformed with a DNA 

sequence comprising the nucleotide coding sequence from 
nucleotide #1 to #850 in Figure 6 and recovering and 
purifying from the culture medium a protein comprising 
amino acid #143 to #281 of Figure 6, or similar amino 

25 acid sequences with heterogenous N-termini, substantially 
free from other proteinaceous materials or other 
contaminating materials with which it is co-produced. 
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This BMP-8 may also be produced in g. cpU by inserting 
into a vector the sequence encoding amino acid #143 to 
281 of Figure 6 with a Met inserted before amino acid 
#143. Human BMP-8 may be further characterized by the 
5 ability to demonstrate cartilage and/or bone formation 
activity in the rat bone formation assay. 

Each above described BMP protein in its native, 
non-reduced dimeric form may be further characterized by 
an apparent molecular weight on a 12% Laemmli gel ranging 

10 between approximately 28kD to approximately 40kD. 

Analogs or modified versions of the DNA and amino acid 
sequences described herein which provide proteins or 
active fragments displaying bone stimulating or repairing 
activity in the rat bone formation assay described below 

15 in Example 9, are also classifed as suitable BMPs for use 

« 

in this invention, further provided that the proteins or 
fragments contain one or more Cys residues for 
participation in disulfide linkages. Useful 
modifications of these sequences may be made by one of 

20 skill in the art with resort to known recombinant genetic 
engineering techniques. Production of these BMP 
sequences in mammalian cells produces homodimers, 
generally mixtures of homodimers having heterologous N 
termini. Production of these BMP sequences in E.csli 

25 produces monomeric protein species. 

Thus, according to this invention one 
recombinant heterodimer of the present invention 
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comprises the association of a human BMP-2, including, 
e.g., a monomer ic strand from a mature BMP-2 subunit as 
described above or an active fragment thereof, bound 
through one or up to seven covalent, disulfide linkages 
5 to a human BMP-5 including, e.g., a monomeric strand from 
a mature BMP-5 subunit as described above or an active 
fragment thereof. Another recombinant heterodimer of the 
present invention comprises the association of a human 
BMP-2, as described above, bound through one or up to 

10 seven covalent, disulfide linkages to a human BMP-6, 

including, e.g., a monomeric strand from a BMP-6 subunit 
as described above or an active fragment thereof. 
Another recombinant heterodimer 'of the present invention 
comprises the association of a human BMP-2, as described 

15 above, bound through one or up to seven covalent, 

disulfide linkages to a human BMP-7, including, e.g., a 
monomeric strand of a BMP-7 subunit as described above or 
an active fragment thereof. Another recombinant 
heterodimer of the present invention comprises the 

20 association of a human BMP-2, as described above, bound 
through one or up to seven covalent, disulfide linkages 
to a human BMP-8, including, e.g., a monomeric strand of 
a BMP-8 subunit as described above or an active fragment 
thereof . 

25 Still another recombinant heterodimer of the 

present invention comprises the association of a human 
BMP-4, including, e.g., a monomeric strand of a BMP-4 
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subunit as described above or an active fragment thereof, 
bound through one or up to seven covalent, disulfide 
linkages to a human BMP-5, as described above. Another 
recombinant heterodimer of the present invention 
5 comprises the association of a human BMP-4, as described 
above, bound through one or more covalent, disulfide 
linkages to a human BMP-6, as described above. Another 
recombinant heterodimer of the present invention 
comprises the association of a human BMP-4, as described 

10 above bound through one or more covalent, disulfide 

linkages to a human BMP-7, as described above. Another 
recombinant heterodimer of the present invention 
comprises the association of a human BMP-4, as described 
above, bound through one or more covalent, disulfide 

15 linkages to a human BMP-8, as described above. 

f 

A further recombinant heterodimer of the 
present invention . comprises the association of a human 
BMP-2, including, e.g., a monomeric strand from a mature 
BMP-2 subunit as described above or an active fragment 

20 thereof, bound through at least one disulfide linkage to 
a human BMP-3 including, e.g., a monomeric strand from a 
mature BMP-3 subunit as described above or an active 
fragment thereof. Another recombinant heterodimer of the 
present invention comprises the association of a human 

25 BMP-2, as described above, bound through at least one 
disulfide linkage to a human BMP-4, including, e.g., a 
monomeric strand from a BMP-4 subunit as described above 
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or an active fragment thereof. Another recombinant 
heterodimer of the present invention comprises the 
association of a human BMP-5, as described above, bound 
through at least one disulfide linkage to a human BMP-6, 
5 including, e.g., a monomeric strand of a BMP-6 subunit as 
described above or an active fragment thereof. Another 
recombinant heterodimer of the present invention 
comprises the association of a human BMP-5, as described 
above, bound through at least one disulfide linkage to a 

10 human BMP-7, including, e.g., a monomeric strand of a 
BMP-7 subunit as described above or an active fragment 
thereof. In addition, human BMP-5 may be associated with 
human BMP-8 bound through at least one disulfide linkage 
to a human BMP-8 subunit or active fragment thereof. 

15 Still another recombinant heterodimer of the 

present invention comprises the association of a human 
BMP-6, including, e.g., a monomeric strand of a BMP-6 
subunit as described above or an active fragment thereof, 
bound through at least one disulfide linkage to a human 

20 BMP-7, as described above. Another recombinant 

heterodimer of the present invention comprises the 
association of a human BMP-6, as described above, bound 
through one or more covalent, disulfide linkages to a 
human BMP-8, as described above. Another recombinant 

25 heterodimer of the present invention comprises the 

association of a human BMP-7, as described above bound 
through one or more covalent, disulfide linkages to a 
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human BMP-8, as described above. 

The disulfide linkages formed between the 
monomeric strands of the BMPs may occur between one Cys 
on each strand. Disulfide linkages may form between two 
5 Cys on each BMP. Disulfide linkages may form between 
three Cys on each BMP. Disulfide linkages may form 
between four Cys on each BMP. Disulfide linkages may 
form between five Cys on each BMP. Disulfide linkages 
may form between six Cys on each BMP. Disulfide linkages 

10 may form between seven Cys on each BMP. These disulfide 
linkages may form between adjacent Cys on each BMP or 
between only selected Cys interspersed within the 
respective protein sequence. Various heterodimers having 
the same BMP component strands may form with different 

15 numbers of disulfide linkages. Various heterodimers 
having the same BMP component strands may form with 
disulfide bonds at different Cys locations. Different 
heterodimers encompassed by this invention having the 
same BMP components may differ based upon their 

20 recombinant production in mammalian cells, bacterial 
cells, insect or yeast cells. 

These recombinant heterodimers may be 
characterized by increased alkaline phosphatase activity 
in the W20 mouse stromal cell line bioassay (Example 8) 

25 compared to the individual BMP homodimers, one strand of 
which forms each heterodimer. Further, these 
heterodimers are characterized by greater activity in the 
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W20 bioassay than is provided by simple mixtures of the 
individual BMP dimers. Preliminary characterization of 
heterodimers measured on the W20 bioassay have 
demonstrated that heterodimers of BMP-2 with BMP-5, BMP-6 
5 or BMP-7 are very active. Similarly, heterodimers of 

BMP-4 with BMP-5, BMP-6 or BMP-7 are strongly active in 
the W20 bioassay. 

Heterodimers of this invention may also be 
characterized by activity in bone growth and stimulation 

10 assays. For example, a heterodimer of this invention is 
also active in the rat bone formation assay described 
below in Example 9. The heterodimers are also active in 
the osteocalcin bioassay described in Example 8. Other 
characteristics of a heterodimer of this invention 

15 include co-precipitation with anti-BMP antibodies to the 
two different constituent BMPs, as well as characteristic 
results on Western blots, high pressure liquid 
chromatography (HPLC) and on two-dimensional gels, with 
and without reducing conditions. 

20 One embodiment of the method of the present 

invention for producing recombinant BMP heterodimers 
involves culturing a suitable cell line, which has been 
co-transf ected with a DNA sequence coding for expression 
of a first BMP or fragment thereof and a DNA sequence 

25 coding for expression of a second BMP or fragment 

thereof, under the control of known regulatory sequences. 
The transformed host cells are cultured and the 
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heterodimeric protein recovered and purified from the 

culture medium. 

In another embodiment of this method which is 
the presently preferred method of expression of the 
5 heterodimers of this invention, a single host cell, e.g., 
a CHO DUKX cell, is co-transf ected with a first DNA 
molecule containing a DNA sequence encoding one BMP and a 
second DNA molecule containing a DNA sequence encoding a 
second selected BMP. One or both plasmids contain a 

10 selectable marker that can be used to establish stable 

cell lines expressing the BMPs. These separate plasmids 
containing distinct BMP genes on seperate transcription 
units are mixed and transfected into the CHO cells using 
conventional protocols. A ratio of plasmids that gives 

15 maximal expression of activity in the W20 assay, 
generally, 1:1, is determined. 

For example, as described in detail in Example 
3, equal ratios of a plasmid containing the first BMP and 
a dihydrofolate reductase (DHFR) marker gene and another 

20 plasmid containing a second BMP and a DHFR marker gene 

can be co-introduced into DHFR-deficient CHO cells, DUKX- 
BII, by calcium phosphate coprecipitation and 
transfection, electroporation, microinjection, protoplast 
fusion or lipofection. Individual DHFR expressing 

25 transformants are selected for growth in alpha media with 
dialyzed fetal calf serum by conventional means. DHFR+ 
cells containing increased gene copies can be selected 
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for propagation in increasing concentrations of 
methotrexate (MTX) (e.g. sequential steps in 0.02, 0.1, 
0.5 and 2.0 uM MTX) according to the procedures of 
Kaufman and Sharp, J. Mol. Biol. , 159:601-629 (1982); and 
5 Kaufman et al, Mol. Cell Biol. . 5:1750 (1983). 

Expression of the heterodimer or at least one BMP linked 
to DHFR should increase with increasing levels of MTX 
resistance. Cells that stably express either or both 
BMP /DHFR genes will survive. However at a high 

10 frequency, cell lines stably incorporate and express both 
plasmids that were present during the initial 
transfection. The conditioned medium is thereafter 
harvested and the heterodimer isolated by conventional 
methods and assayed for activity. This approach can be 

15 employed with DHFR-def icient cells. 

As an alternative embodiment of this method, a 
DNA molecule containing one selected BMP gene may be 
transfected into a stable cell line which already 
expresses another selected BMP gene. For example as 

20 described in detail in Example 3 below, a stable CHO cell 
line expressing BMP-7 with the DHFR marker (designated 
7MB9) [Genetics Institute, Inc] is transfected with a 
plasmid containing BMP-2 and a second selectable marker 
gene, e.g., neomycin resistance (Neo) . After 

25 transfection, the cell is cultured and suitable cells 

selected by treatment with MTX and the antibiotic, G-418. 
Surviving cells are then screened for the expression of 
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the heterodimer. This expression system has the 
advantage of permitting a single step selection. 

Alternative dual selection strategies using 
different cell lines or different markers can also be 
5 used. For example, the use of an adenosine deaminase 

(ADA) marker to amplify the second BMP gene in a stable 
CHO cell line expressing a different BMP with the DHFR 
marker may be preferable, since the level of expression 
can be increased using deoxycoformycin (DCF) -mediated 
10 gene amplification. (See the ADA containing plasmid 
described in Example 1) . Alternatively, any BMP cell 
line made by first using this marker can then be the 
recipient of a second BMP expression vector containing a 
distinct marker and selected for dual resistance and BMP 

15 coexpression. 

Still another embodiment of a method of 
expressing the heterodimers of this invention includes 
transfecting the host cell with a single DNA molecule 
encoding multiple genes for expression either on a single 

20 transcription unit or on separate transcription units. 

Multicistronic expression involves multiple polypeptides 
encoded within a single transcript, which can be 
efficiently translated from vectors utilizing a leader 
sequence, e.g., from the EMC virus, from poliovirus, or 

25 from other conventional sources of leader sequences. Two 
BMP genes and a selectable marker can be expressed within 
a single transcription unit. For example, vectors 
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containing the configuration BMPx-EMC-BMPy-DHFR or BMPx- 
EMC-BMPy-EMC-DHFR can be transfected into CHO cells and 
selected and amplified using the DHFR marker. A plasmid 
may be constructed which contains DNA sequences encoding 
5 two different BMPs, one or more marker genes and a 
suitable leader or regulatory sequence on a single 
transcription unit. 

Similarly > host cells may be transfected with a 
single plasmid which contains separate transcription 

10 units for each BMP. A selectable marker, e.g., DHFR, can 
be contained on a another transcription unit, or 
alternatively as the second cistron on one or both of the 
BMP genes. These plasmids may be transfected into a 
selected host cell for expression of the heterodimer , and 

15 the heterodimer isolated from the cells or culture medium 
as described above. 

Another embodiment of this expression method 
involves cell fusion. Two stable cell lines which 
express selected BMPs, such as a cell line expressing 

20 BMP-2 (e.g., 2EG5) and a cell line expressing BMP-7 
(e.g., 7MB9) , developed using the DHFR/MTX gene 
amplification system and expressing BMP at high levels, 
as described in Example 1 and in the above incorporated 
U.S. applications, can be transfected with one of several 

25 dominant marker genes (e.g., neo r , hygromycin r , GPT) . 

After sufficient time in coculture (approximately one 
day) one resultant cell line expressing one BMP and a 
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dominant marker can be fused with a cell line expressing 
a different BMP and preferably a different marker using a 
fusigenic reagent, such as polyethylene glycol, Sendai 
virus or other known agent. 
5 The resulting cell hybrids expressing both 

dominant markers and DHFR can be selected using the 
appropriate culture conditions, and screened for 
coexpression of the BMPs or their fragments. The 
selected hybrid cell contains sequences encoding both 

10 selected BMPs, ^nd the heterodimer is formed in the cell 
and then secreted. The heterodimer is obtained from the 
conditioned medium and isolated and purified therefrom by 
conventional methods (see e.g., Example 4). The 
resulting heterodimer may be characterized by methods 

15 described herein. 

Cell lines generated from the approaches 
described above can be used to produce co-expressed, 
heterodimeric BMP polypeptides. The heterodimer ic 
proteins are isolated from the cell medium in a form 

20 substantially free from other proteins with which they 

are co-produced as well as from other contaminants found 
in the host cells by conventional purification 
techniques. The presently preferred method of production 
is co-transfection of different vectors into CHO cells 

25 and methotrexate-mediated gene amplification. Stable 
cell lines may be used to generate conditioned media 
containing recombinant BMP that can be purified and 



WO 93/09229 



PCT/US92/09430 



27 

assayed for in vitro and in vivo activities. For 
example , the resulting heterodimer-producing cell lines 
obtained by any of the methods described herein may be 
screened for activity by the assays described in Examples 
5 8 and 9, RNA expression, and protein expression by sodium 
dodecyl sulfate polyacrylamide gel electrophoresis (SDS- 
PAGE) . 

The above-described methods of co-expression of 
the heterodimers of this invention utilize suitable host 

10 cells or cell lines. Suitable cell preferably include 
mammalian cells, such as Chinese hamster ovary cells 
(CHO) . The selection of suitable mammalian host cells 
and methods for transformation, -culture, amplification, 
screening and product production and purification are 

15 known in the art. See, e.g., Gething and Sambrook, 

Nature, 221:620-625 (1981), or alternatively, Kaufman et 
al, Mol. Cell. Biol. . £(7) : 1750-1759 (1985) or Howley et 
al, U. S. Patent 4,419,446. Other suitable mammalian 
cell lines are the CV-l cell line, BHK cell lines and the 

20 293 cell line. The monkey COS-1 cell line is presently 

believed to be inefficient in BMP heterodimer production. 

Many strains of yeast cells known to those 
skilled in the art may also be available as host cells 
for expression of the polypeptides of the present 

25 invention, e.g., Saccharomvces cerevisiae . Additionally, 
where desired, insect cells may be utilized as host cells 
in the method of the present invention. See, e.g., 
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Miller et al, fienetic Engineering, 8:277-298 (Plenum 
Press 1986) and references cited therein. 

Another method for producing a biologically 
active heterodimeric protein of this invention may be 
5 employed where the host cells are microbial, preferably 

bacterial cells, in particular I*. coJLi. For example, the 
various strains of E. coli (e.g., HB101, MC1061) are 
well-known as host cells in the field of biotechnology. 
Various strains of L subtilis . pseudomonas, other 

10 bacilli and the like may also be employed in this method. 

This method, which may be employed to produce 
monomers and dimers (both homodimers and heterodimers) is 
described in European Patent Application No. 433,225, 
incorporated herein by reference. Briefly, this process 

15 involves culturing a microbial host comprising a 

nucleotide sequence encoding the desired BMP protein 
linked in the proper reading frame to an expression 
control sequence which permits expression of the protein 
and recovering the monomeric, soluble protein. Where the 

20 protein is insoluble in the host cells, the water- 
insoluble protein fraction is isolated from the host 
cells and the protein is solubilized. After 
chromatographic purification, the solubilized protein is 
subjected to selected conditions to obtain the 

25 biologically active dimeric configuration of the protein. 
This process, which may be employed to produce the 
heterodimers of this invention, is described specifically 
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in Example 7, for the production of a BMP-2 homodimer. 

Another aspect of the present invention 
provides DNA molecules or plasmid vectors for use in 
expression of these recombinant heterodimers . These 
5 plasmid vectors may be constructed by resort to known 

methods and available components known to those of skill 
in the art. In general, to generate a vector useful in 
the methods of this invention, the DNA encoding the 
desired BMP protein is transferred into one or more 
10 appropriate expression vectors suitable for the selected 
host cell. 

It is presently contemplated that any 
expression vector suitable for efficient expression in 
mammalian cells may be employed to produce the 

15 recombinant heterodimers of this invention in mammalian 
host cells. Preferably the vectors contain the selected 
BMP DNA sequences described above and in the Figures, 
which encode selected BMP components of the heterodimer. 
Alternatively, vectors incorporating modified sequences 

20 as described in the above-referenced patent applications 
are also embodiments of the present invention and useful 
in the production of the vectors. 

In addition to the specific vectors described 
in Example 1, one skilled in the art can construct 

25 mammalian expression vectors by employing the sequence of 
Figures 1-6 or other DNA sequences containing the coding 
sequences of Figures 1-6 (SEQ ID NOs: 1, 3, 5, 7, 9 and 
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11), or other modified sequences and known vectors, such 
as pCD [Okayama et al, Mni . Pell Biol.. 2:161-170 (1982)] 
and pJL3, pJL4 [Gough et al, EMBO J- . 4:645-653 (1985)]. 
The BMP DNA sequences can be modified by removing the 
5 non-coding nucleotides on the 5' and 3' ends of the 

coding region. The deleted non-coding nucleotides may or 
may not be replaced by other sequences known to be 
beneficial for expression. The transformation of these 
vectors into appropriate host cells as described above 

10 can produce desired heterodimers. 

One skilled in the art could manipulate the 
sequences of Figures 1-6 by eliminating or replacing the 
mammalian regulatory sequences flanking the coding 
sequence with e.g., yeast or insect regulatory sequences, 

15 to create vectors for intracellular or extracellular 
expression by yeast or insect cells. [See, e.g., 
procedures described in published European Patent 
Application 155,476] for expression in insect cells; and 
procedures described in published PCT application 

20 WO86/00639 and European Patent Application EPA 123,289 
for expression in yeast cells]. 

Similarly, bacterial sequences and preference 
codons may replace sequences in the described and 
exemplified mammalian vectors to create suitable 

25 expression systems for use in the production of BMP 

monomers in the method described above. For example, the 
coding sequences could be further manipulated (e.g. , 
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ligated to other known linkers or modified by deleting 
non-coding sequences therefrom or altering nucleotides 
therein by other known techniques) . The modified BMP 
coding sequences could then be inserted into a known 
5 bacterial vector using procedures such as described in T. 
Taniguchi et al, Proc. Natl. Acad. Sci. USA . 77:5230-5233 
(1980) . The exemplary bacterial vector could then be 
transformed into bacterial host cells and BMP 
heterodimers expressed thereby. An exemplary vector for 

10 microbial, e.g., bacterial, expression is described below 
in Example 7. 

Other vectors useful in the methods of this 
invention may contain multiple genes in a single 
transcription unit. For example, a proposed plasmid 

15 p7E2D contains the BMP-7 gene followed by the EMC leader 
sequence, followed by the BMP-2 gene, followed by the 
DHFR marker gene. Another example is plasmid p7E2ED 
which contains the BMP-7 gene, the EMC leader, the BMP-2 
gene, another EMC leader sequence and the DHFR marker 

20 gene. Alternatively , the vector may contain more than 
one transcription unit. As one example, the plasmid 
p2ED7ED contains a transcription unit for BMP-2 and a 
separate transcription unit for BMP-7, i.e., BMP-2-EMC- 
DHFR and BMP- 7 -EMC-DHFR . Alternatively, each 

25 transcription unit on the plasmid may contain a different 
marker gene. For example, plasmid p2EN7ED contains BMP- 
2-EMC-Neo and BMP-7 -EMC-DHFR. 
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Additionally the vectors also contain 
appropriate expression control sequences which are 
capable of directing the replication and expression of 
the BMP in the selected host cells. Useful regulatory 
5 sequences for such vectors are known to one of skill in 
the art and may be selected depending upon the selected 
host cells. Such selection is routine and does not form 
part of the present- invention. Similarly, the vectors 
may contain one or more selection markers, such as the 

10 antibiotic resistance gene, Neo or selectable markers 
such as DHFR and ADA. The presently preferred marker 
gene is DHFR. These marker genes may also be selected by 
one of skill in the art. 

Once they are expressed by one of the methods 

15 described above, the heterodimers of this invention may 
be identified and characterized by application of a 
variety of assays and procedures. A co-precipitation 
(immunoprecipitation) assay may be performed with 
antibodies to each of the BMPs forming the heterodimer. 

20 Generally antibodies for this use may be developed by 
conventional means, e.g., using the selected BMP, 
fragments thereof, or synthetic BMP peptides as antigen. 
Antibodies employed in assays are generally polyclonal 
antibodies made from individual BMP peptides or proteins 

25 injected into rabbits according to classical techniques. 
This assay is performed conventionally, and permits the 
identification of the heterodimer, which is precipitated 
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by antibodies to both BMP components of the heterodimer. 
In contrast , only one of the two antibodies causes 
precipitation of any homodimeric form which may be 
produced in the process of producing the heterodimer. 
5 Another characterizing assay is a Western 

assay, employing a precipitating antibody, a probing 
antibody and a detecting antibody. This assay may also 
be performed conventionally, by using an antibody to one 
of the BMPs to precipitate the dimers, which are run on 

10 reducing SDS-PAGE for Western analysis. An antibody to 
the second BMP is used to probe the precipitates on the 
Western gel for the heterodimer. A detecting antibody, 
such as a goat-antirabbit antibody labelled with 
horseradish peroxidase (HRP) , is then applied, which will 

15 reveal the presence of one of the component subunits of 
the heterodimer. 

Finally, the specific activity of the 
heterodimer may be quantitated as described in detail in 
Example 6. Briefly, the amount of each BMP is 

20 . quantitated using Western blot analysis or pulse 

labelling and SDS-PAGE analysis in samples of each BMP 
homodimer and the heterodimer. The W20 activity is also 
determined as described specifically in Example 8. The 
relative specific activities may be calculated by the 

25 formula: W20 alkaline phosphatase activity/ amount of BMP 
on Western blot or by f luorography. As one example, this 
formula has been determined for the BMP-2/7 heterodimer, 
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demonstrating that the heterodimer has an estimated 5 to 
50 fold higher specific activity than the BMP-2 
homodimer. 

The heterodimers of the present invention may 
5 have a variety of therapeutic and pharmaceutical uses, 
e.g., in compositions for wound healing, tissue repair, 
and in similar compositions which have been indicated for 
use of the individual BMPs. Increased potency of the 
heterodimers over the individual BMPs may permit lower 

10 dosages of the compositions in which they are contained 
to be administered to a patient in comparison to dosages 
of compositions containing only a single BMP. A 
heterodimer ic protein of the present invention, which 
induces cartilage and/ or bone growth in circumstances 

15 where bone is not normally formed, has application in the 
healing of bone fractures and cartilage defects in humans 
and other animals. Such a preparation employing a 
heterodimeric protein of the invention may have 
prophylactic use in closed as well as open fracture 

20 reduction and also in the improved fixation of artificial 
joints. De novo bone formation induced by an osteogenic 
agent contributes to the repair of congenital, trauma 
induced, or oncologic resection induced craniofacial 
defects, and also is useful in cosmetic plastic surgery. 

25 a heterodimeric protein of this invention may 

be used in the treatment of periodontal disease, and in 
other tooth repair processes. Such agents may provide an 
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environment to attract bone-forming cells, stimulate 
growth of bone-forming cells or induce differentiation of 
progenitors of bone-forming cells. Heterodimeric 
polypeptides of the invention may also be useful in the 
5 treatment of osteoporosis. A variety of osteogenic, 

cartilage-inducing and bone inducing factors have been 
described. See, e.g., European Patent Applications 
148,155 and 169,016 for discussions thereof. 

The proteins of the invention may also be used 

10 in wound healing and related tissue repair. The types of 
wounds include, but are not limited to burns, incisions 
and ulcers. (See, e.g., PCT Publication WO84/01106 
incorporated by reference herein for discussion of wound 
healing and related tissue repair) • 

15 Additionally, the proteins of the invention may 

increase neuronal survival and therefore be useful in 
transplantation and treatment of conditions exhibiting a 
decrease in neuronal survival. 

In view of the usefulness of the heterodiroers, 

20 therefore, a further aspect of the invention is a 
therapeutic method and composition for repairing 
fractures and other conditions related to cartilage 
and/or bone defects or periodontal diseases. In 
addition, the invention comprises therapeutic methods and 

25 compositions for wound healing and tissue repair. Such 
compositions comprise a therapeutically effective amount 
of a heterodimeric protein of the invention in admixture 
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with a pharmaceutical^ acceptable vehicle, carrier or 
matrix. The preparation and formulation of such 
physiologically acceptable protein compositions, having 
due regard to pH, isotonicity, stability and the like, is 
5 within the skill of the art. 

It is expected that the proteins of the 
invention may act in concert with other related proteins 
and growth factors. ■ Therapeutic methods and compositions 
of the invention therefore comprise a therapeutic amount 

10 of a heterodimeric protein of the invention with a 
therapeutic amount of at least one of the other BMP 
proteins disclosed in co-owned and concurrently filed U. 
S. applications described above. Such combinations may 
comprise separate molecules of the BMP proteins or other 

15 heteromolecules of the present invention. 

In further compositions, heterodimeric proteins 
of the invention may be combined with other agents 
beneficial to the treatment of the bone and/or cartilage 
defect, wound, or tissue in question. These agents 

20 include various growth factors such as epidermal growth 
factor (EGF) , platelet derived growth factor (PDGF) , 
transforming growth factors (TGF-er and TGF-0) , and 
insulin-like growth factor (IGF) . 

The therapeutic compositions are also presently 

25 valuable for veterinary applications due to the lack of 
species specificity in BMP proteins. Particularly 
domestic animals and thoroughbred horses, in addition to 
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humans , are desired patients for such treatment with 
heterodimeric proteins of the present invention. 

The therapeutic method includes administering 
the composition topically, systematically, or locally as 
5 an implant or device. When administered, the therapeutic 
composition for use in this invention is, of course, in a 
pyrogen-free, physiologically acceptable form. Further, 
the composition may desirably be encapsulated or injected 
in a viscous form for delivery to the site of bone, 

10 cartilage or tissue damage. Topical administration may 
be suitable for wound healing and tissue repair. 
Therapeutically useful agents other than the 
heterodimeric proteins of the invention which may also 
optionally be included in the composition as described 

15 above, may alternatively or additionally, be administered 
simultaneously or sequentially with the heterodimeric BMP 
composition in the methods of the invention. Preferably 
for bone and/or cartilage formation, the composition 
would include a matrix capable of delivering the 

20 heterodimeric protein-containing composition to the site 
of bone and/or cartilage damage, providing a structure 
for the developing bone and cartilage and optimally 
capable of being resorbed into the body. Such matrices 
<, may be formed of materials presently in use for other 

25 implanted medical applications. 

The choice of matrix material is based on 
biocompatibility, biodegradability, mechanical 
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properties , cosmetic appearance and interface properties. 
The particular application of the heterodimeric BMP 
compositions will define the appropriate formulation. 
Potential matrices for the compositions may be 
5 biodegradable and chemically defined calcium sulfate , 
tricalciumphosphate, hydroxyapatite, poly lactic acid, 
polyglycolic acid and polyanhydr ides . Other potential 
materials are biodegradable and biologically well 
defined, such as bone or dermal collagen. Further 
10 matrices are comprised of pure proteins or extracellular 
matrix components. Other potential matrices are 
nonbiodegradable and chemically defined, such as sintered 
hydroxyapatite, bioglass, aluminates, or other ceramics. 
Matrices may be comprised of combinations of any of the 
15 above mentioned types of material, such as poly lactic 
acid and hydroxyapatite or collagen and 
tricalciumphosphate. The bioceramics may be altered in 
composition, such as in calcium-alumina te-phosphate and 
processing to alter pore size, particle size, particle 
20 shape, and biodegradability. 

Presently preferred is a 50:50 (mole weight) 
copolymer of lactic acid and glycolic acid in the form of 
porous particles having diameters ranging from 150 to 800 
microns. In some applicatons, it will be useful to 
25 utilize a sequestering agent, such as carboxymethyl 

cellulose or autologous blood clot, to prevent the BMP 
compositions from dissassociating from the matrix. 
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The dosage regimen of a heterodimeric protein- 
containing pharmaceutical composition will be determined 
by the attending physician considering various factors 
which modify the action of the heterodimeric proteins, 
5 e.g. amount of bone weight desired to be formed, the site 
of bone damage, the condition of the damaged bone, the 
size of a wound, type of damaged tissue, the patient's 
age, sex, and diet, the severity of any infection, time 
of administration and other clinical factors. The dosage 

10 may vary with the type of matrix used in the 

reconstitution and the BMP proteins in the heterodimer 
and any additional BMP or other proteins in the 
pharmaceutical composition. For example, the addition of 
other known growth factors, such as IGF I (insulin like 

15 growth factor I) , to the final composition, may also 

effect the dosage. Progress can be monitored by periodic 
assessment of bone growth and/or repair, for example, X- 
rays, histomorphometric determinations and tetracycline 
labeling. 

20 The following examples are illustrative of the 

present invention and do not limit its scope. 



EXAMPLE 1 - BMP Vector Constructs and Cell Lines 
A. BMP-? Vectors 

The mammalian expression vector pMT2 CXM 
25 is a derivative of p91023 (b) [Wong et al, Science, 

221:810-815 (1985)] differing from the latter in that it 
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contains the ampicillin resistance gene (Amp) in place of 
the tetracycline resistance gene (Tet) and further 
contains a Xhol site for insertion of cDNA clones. The 
functional elements of pMT2 CXM have been described [R. 
J. Kaufman, Pt-oc. Natl - Acad. Sci. USA. 82:689-693 
(1985)] and include the adenovirus VA genes, the SV40 
origin of replication including the 72 bp enhancer, the 
adenovirus major late promoter including a 5' splice site 
and the majority of the adenovirus tripartite leader 
sequence present on adenovirus late mRNAs, a 3' splice 
acceptor site, a DHFR insert, the SV40 early 
polyadenylation site (SV40) , and pBR322 sequences needed 
for propagation in JjL. coli . 

EcoRI digestion of pMT2-VWF, which has 
been deposited with the American Type Culture Collection 
(ATCC) , Rockville, MD (USA) under accession number ATCC 
67122, excises the cDNA insert present in pMT2-VWF, 
yielding pMT2 in linear form. Plasmid pMT2 can be 
ligated and used to transform E^. coli HB 101 or DH-5 to 
ampicillin resistance. Plasmid pMT2 DMA can be prepared 
by conventional methods. 

Plasmid pMT2 CXM is then constructed using 
loopout/in mutagenesis [Morinaga et al, Biotechnology, 
84 : 636 (1984)]. This removes bases 1075 to 1145 relative 
to the Hindlll site near the SV40 origin of replication 
and enhancer sequences of pMT2. In addition it inserts 
the following sequence: 



WO 93/09229 



PCT/US92/09430 



41 

5' P0 4 -CATGGGCAGCTCGAG-3 ' (SEQ ID NO: 15) 

at nucleotide 1145. This sequence contains the 

recognition site for the restriction endonuclease Xhol. 

A derivative of pMT2 CXM, termed plasmid pMT23, 

5 contains recognition sites for the restriction 

endonucleases PstI, EcoRI, Sail and Xhol. 

Full length BMP-2 cDNA (Fig. 1) (SEQ ID NO: 1) 

is released from the XGT10 vector by digestion with EcoRI 

and subcloned into pSP65 [Promega Biotec, Madison, 

10 Wisconsin; see, e.g., Melton et al, Nucl. Acids Res. r 

12:7035-7056 (1984)] in both orientations yielding pBMP-2 

#39-3 or pBMP-2 #39-4. 

The majority of the untranslated regions of the 

BMP-2 cDNA are removed in the following manner. The 5' 

sequences are removed between the Sail site in the 

adapter (present from the original cDNA cloning) and the 

Sail site 7 base pairs upstream of the initiator ATG by 

digestion of the pSP65 plasmid containing the BMP-2 cDNA 

with Sail and religation. The 3' untranslated region is 

removed using heteroduplex mutagenesis using the 

oligonucleotide 

5 ' GAGGGTTGTGGGTGTCG CTAG TGA GTCGACT ACAGCAAAATT 3 ' . 

End Sail 

(SEQ ID NO: 16) 

The sequence contains the terminal 3' coding region of 
the BMP-2 cDNA, followed immediately by a recognition 
site for Sail. The sequence introduces a Sail site 
following the termination (TAG) codon. 
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The Sail fragment of this clone was subcloned 
into the expression vector pMT23, yielding the vector 
PMT23-BMP2aUT. Restriction enzyme sites flank the BMP-2 
coding region in the sequence PstI-EcoRI-SalI-BMP-2 cDNA- 

5 sall-EcoRI-XhoI. 

The expression plasmid pED4 [Kaufman et al, 
Knr.i . Acids Res. . 19:4485-4490 (1991)] was linearized by 
digestion with EcoRI and treated with calf intestinal 
phosphatase. The BMP-2 cDNA gene was excised from pMT23- 

0 BMP2aUT by digestion with EcoRI and recovery of the 1.2 
kb fragment by electrophoresis through a 1.0% low melt 
agarose gel. The linearized pED4 vector and the EcoRI 
BMP-2 fragment were ligated together, yielding the BMP-2 
expression plasmid pBMP2A-EMC. 

.5 Another vector pBMP-2A-EN contains the same 

sequences contained within the vector pBMP2A-EMC, except 
the DHFR gene has been replaced by conventional means 
with the neomycin resistance gene from the Tn5 
transposable element. 

0 B. BMP4 Vectors 

A BMP-4 cDNA sequence set forth in Figure 
2 (SEQ ID NO: 3), in which the 3' untranslated region is 
removed, is made via heteroduplex mutagenesis with the 
mutagenic oligonucleotide: 
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5' GGATGTGGGTGCCGCTGACTCTAGA6TCGAC GGAATTC 3' 

End EcoRI 

(SEQ ID NO: 17) 

This deletes all of the sequences 3 ' to the translation 

5 terminator codon of the BMP-4 cDNA, juxtaposing this 

terminator codon and the vector polylinker sequences. 

This step is performed in an SP65 vector [Proroega 

Biotech] and may also be conveniently performed in pMT2- 

derivatives containing the BMP-4 cDNA similar to the BMP2 

10 vectors described above. The 5 ' untranslated region is 

removed using the restriction endonuclease BsmI, which 

cleaves within the eighth codon of BMP-4 cDNA. 

Reconstruction of the first eight codons 

is accomplished by ligation to oligonucleotides: 

15 EcoRI Initiator BsmI 

5' ^AATTCACCATGATTCCTGGTAACC GAATGCT 3' (SEQ ID NO: 18) 

and 

3' GTGGTACTAAGGACCATTGGCTTAC 5' (SEQ ID NO: 19) 

These oligonucleotides form a duplex which has a BsmI 
20 complementary cohesive end capable of ligation to the 
BsmI restricted BMP-4 cDNA, and it has an EcoRI 
complementary cohesive end capable of ligation to the 
EcoRI restricted vector pMT2. Thus the cDNA for BMP-4 
with the 5' and 3' untranslated regions deleted, and 
25 retaining the entire encoding sequence is contained 

within an EcoRI restriction fragment of approximately 1.2 
kb. 

The pMT2 CXM plasmid containing this BMP-4 
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sequence is designated P XMBMP-4aUT. It is digested with 
EcoRI in order to release the BMP-4 cDNA containing 
insert from the vector. This insert is subcloned into 
the EcoRI site of the mammalian expression vector pED4, 
5 resulting pBMP4A-EMC. 

C. iwp-b Vectors 

A BMP-5 cDNA sequence comprising the 
nucleotide sequence from nucleotide #699 to #2070 of Fig. 
5 (SEQ ID NO: 9) is specifically amplified as follows. 

10 The oligonucleotides CGACCTGCAGCCACCATGCATCTGACTGTA (SEQ 

ID NO: 20) and TGCCTGCAGTTTAATATTAGTGGCAGC (SEQ ID NO: 
21) are utilized as primers to allow the amplification of 
nucleotide sequence #699 to #2070 of Fig. 5 from the BMP- 
5 insert of X-ZAP clone U2-16 [ATCC #68109]. This 

15 procedure introduces the nucleotide sequence 

CGACCTGCAGCCACC (SEQ ID NO: 22) immediately preceeding 
nucleotide #699 and the nucleotide sequence CTGCAGGCA 
immediately following nucleotide #2070. The addition of 
these sequences results in the creation of PstI 

20 restriction endonuclease recognition sites at both ends 
of the amplified DNA fragment. The resulting amplified 
DNA product of this procedure is digested with the 
restriction endonuclease PstI and subcloned into the PstI 
site of the pMT2 derivative pMT21 [Kaufman, Nucl. Acids 

25 Res. . l£:4485-4490 (1991)]. The resulting clone is 
designated H5/5/pMT. 

The insert of H5/5/pMT is excised by PstI 
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digestion and subcloned into the plasmid vector pSP65 
[Promega Biotech] at the PstI site, resulting in plasmid 
BMP5/SP6 • BMP5/SP6 and U2-16 are digested with the 
restriction endonucleases Nsil and Ndel to excise the 
5 portion of their inserts corresponding to nucleotides 

#704 to #1876 of Fig. 5. The resulting 1173 nucleotide 
Nsil-Ndel fragment of clone U2-16 is ligated into the 
Nsil-Ndel site of BMP5/SP6 from which the corresponding 
1173 nucleotide Nsil-Ndel fragment had been removed. The 

10 resulting clone is designated BMP5mix/SP65. 

Direct DNA sequence analysis of BMP5mix/SP65 is 
performed to confirm identity of the nucleotide sequences 
produced by the amplification to* those set forth in Fig. 
5. The clone BMP5mix/SP65 is digested with the 

15 restriction endonuclease PstI resulting in the excision 
of an insert comprising the nucleotides #699 to #2070 of 
Fig. 5 and the additional sequences containing the PstI 
recognition sites as described above. The resulting 1382 
nucleotide PstI fragment is subcloned into the PstI site 

20 of the pMT2 derivative pMT21. This clone is designated 
BMP5mix/pMT21#2. 

The same fragment is also subcloned into the 
PstI site of pED4 to yield the vector designated BMPSmix- 
EMC-11. 

25 D. BMP-6 Vectors 

A BMP-6 cDNA sequence comprising the 
nucleotide sequence from nucleotide #160 to #1706 of 
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Fig. 4 (SEQ ID NO: 7) is produced by a series of 
techniques known to those skilled in the art. The clone 
BMP6C35 [ATCC 68245] is digested with the restriction 
endonucleases Apal and TaqI, resulting in the excision of 
5 a 1476 nucleotide portion of the insert comprising 
nucleotide #231 to #1703 of Fig. 4. Synthetic 
oligonucleotides with Sail restriction endonuclease site 
converters are designed to replace those nucleotides 
corresponding to #160 to #230 and #1704 to #1706 which 
10 are not contained in the 1476 Apal-TaqI fragment of the 
BMP-6 cDNA sequence. 

Oligonucleotide/Sall converters conceived to 

replace the missing 5' 

(TCGACCCACCATGCCGGGGCTGGGGCGGAGGGCGCAGTGGCTGT 
15 GCTGGTGGTGGGGGCTGTGCTGCAGCTGCTGCGGGCC (SEQ ID NO: 23) and 

CGCAGCAGCTGCACAGCAGCCCCCACCACCAGCACAGCCACTGCGCCCTCCGCCCCA 

GCCCCGGCATGGTGGG) (SEQ ID NO: 24) and 3' (TCGACTGGTTT 
(SEQ ID NO: 25) and CGAAACCAG (SEQ ID NO: 26) ) sequences 
are annealed to each other independently. The annealed 

20 5' and 3' converters are then ligated to the 1476 

nucleotide Apal-TaqI described above, creating a 1563 
nucleotide fragment comprising the nucleotide sequence 
from #160 to #1706 of Fig. 4 and the additional sequences 
contrived to create Sail restriction endonuclease sites 

25 at both ends. The resulting 1563 nucleotide fragment is 
subcloned into the Sail site of pSP64 [Promega Biotech, 
Madison, HI]. This clone is designated BMP6/SP64#15. 
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DNA sequence analysis of BMP6/SP64#15 is 
performed to confirm identity of the 5' and 3' sequences 
replaced by the converters to the sequence set forth in 
Fig. 4. The insert of BMP6/SP64#15 is excised by 
5 digestion with the restriction endonuclease Sail. The 

resulting 1563 nucleotide Sail fragment is subcloned into 
the Xhol restriction endonuclease site of pMT21 and 
designated herein as BMP6/pMT21. 

The PstI site of pED4 is converted to a Sail 
10 site by digestion of the plasmid with PstI and ligation 
to the converter oligonucleotides: 

5'-TCGACAGGCTCGCCTGCA-3' (SEQ ID NO: 27) and 
3'-GTCCGAGCGG-5' (SEQ ID NO: 28). 

The above 1563 nucleotide Sail fragment is also subcloned 
15 into the Sail site of this pED4 vector, yielding the 
expression vector BMP6/EMC. 

E. PMP-7 VectPrs 

A BMP-7 sequence comprising the nucleotide 
sequence from nucleotide #97 to #14 02 of Fig. 3 (SEQ ID 
20 NO: 5) is specifically amplified as follows. The 

oligonucleotides CAGGTCGACCCACCATGCACGTGCGCTCA (SEQ ID 
NO: 29) and TCTGTCGACCTCGGAGGAGCTAGTGGC (SEQ ID NO: 30) 
are utilized as primers to allow the amplification of 
nucleotide sequence #97 to #1402 of Fig. 3 from the 
25 insert of clone PEH7-9 [ATCC #68182]. This procedure 

generates the insertion of the nucleotide sequence 
CAGGTCGACCCACC immediately preceeding nucleotide #97 and 
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the insertion of the nucleotide sequence GTCGACAGA 
immediately following nucleotide #1402. The addition of 
these sequences results in the creation of a Sail 
restriction endonuclease recognition site at each end of 
5 the amplified DNA fragment. The resulting amplified DNA 
product of this procedure is digested with the 
restriction endonuclease Sail and subcloned into the Sail 
site of the plasmid vector pSP64 [Promega Biotech, 
Madison, WI] resulting in BMP7/SP6#2. 

10 The clones BMP7/SP6#2 and PEH7-9 are digested 

with the restriction endonucleases Ncol and StuI to 
excise the portion of their inserts corresponding to 
nucleotides #363 to #1081 of Fig. 3. The resulting 719 
nucleotide NcoI-StuI fragment of clone PEH7-9 is ligated 

15 into the NcoI-StuI site of BMP7/SP6#2 from which the 

corresponding 719 nucleotide fragment is removed. The 
resulting clone is designated BMP7mix/SP6. 

Direct DNA sequence analysis of BMP7mix/SP6 
confirmed identity of the 3' region to the nucleotide 

20 sequence from #1082 to #1402 of Fig. 3, however the 5' 
region contained one nucleotide misincorporation . 

Amplification of the nucleotide sequence (#97 
to #1402 of Fig. 3) utilizing PEH7-9 as a template is 
repeated as described above. The resulting amplified DNA 

25 product of this procedure is digested with the 

restriction endonucleases Sail and Pstl. This digestion 
results in the excision of a 747 nucleotide fragment 
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comprising nucleotide #97 to #833 of Fig. 3 plus the 
additional sequences of the 5' priming oligonucleotide 
used to create the Sail restriction endonuclease 
recognition site described earlier. This 747 Sall-PstI 
5 fragment is subcloned into a Sall-PstI digested pSP65 
[Promega Biotech, Madison, WI] vector resulting in 
5 / BMP7/SP65. DNA sequence analysis demonstrates that the 
insert of the 5'BMP7/SP65#1 comprises a sequence 
identical to nucleotide #97 to #362 of Fig. 3. 

10 The clones BMP7mix/SP6 and 5'BMP7/SP65 are 

digested with the restriction endonucleases Sail and 
Ncol. The resulting 3' Ncol-Sall fragment of BMP7mix/SP6 
comprising nucleotides #363 to #1402 of Fig. 3 and 5' 
Sall-Ncol fragment of 5'BMP7/SP65 comprising nucleotides 

15 #97 to #362 of Fig. 3 are ligated together at the Ncol 
restriction sites to produce a 1317 nucleotide fragment 
comprising nucleotides #97 to #14 02 of Fig. 3 plus the 
additional sequences derived from the 5' and 3' 
oligonucleotide primers which allows the creation of Sail 

20 restriction sites at both ends of this fragment. 

This 1317 nucleotide Sail fragment is 
ligated nto the Sail site of the pMT2 derivative pMT2Cla- 
2. pMT2Cla-2 is constructed by digesting pMT21 with 
EcoRV and Xhol, treating the digested DNA with Klenow 

25 fragment of DNA polymerase I and ligating Clal linkers 

(NEBio Labs, CATCGATG) . This removes bases 2171 to 2420 
starting from the Hindi I I site near the SV40 origin of 
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replication and enhancer sequences of pMT2 and introduces 
a unique Clal site, but leaves the adenovirus VAI gene 
intact, resulting in pMT2Cla-2. This clone is designated 
BMP-7-pMT2 . 

5 The insert of BMP-7-pMT2 is excised by 

digestion with the restriction endonuclease Sail. The 
resulting 1317 nucleotide Sail fragment is subcloned into 
the Xhol restriction endonuclease site of pMT21 to yield 
the clone BMP-7/pMT21. This Sail fragment is also 

10 subcloned into the Sail site of the pED4 vector in which 
the PstI site was converted into a Sail site as described 
above, resulting in the vector pBMP7/EMC#4. 
F. bmp-8 Vectors 

At present no mammalian BMP-8 vectors have 

15 been constructed. However, using the sequence of Figure 
6 (SEQ ID NO: 11) , it is contemplated that vectors 
similar to those described above for the other BMPs may 
be readily constructed. A bacterial expression vector 
similar to the BMP-2 vector described in detail in 

20 Example 7 may also be constructed for BMP-8, by 

introducing a Met before the amino acid #284 Ala of Fig. 
6. This sequence of BMP-8 is inserted into the vector 
PALBP2-781 in place of the BMP-2 sequence. See Example 
7. 

25 G. BMP Vectors Conta ining the Adenosine 

Deaminase ( Ada) Marker 

BMP genes were inserted into the vector 
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pMT3SV2Ada [R. J. Kaufman, Meth. Enz. . 185:537-566 
(1990)] to yield expression plasmids containing separate 
transcription units for the BMP cDNA gene and the 
selectable marker Ada, pMT3SV2Ada contains a polylinker 
5 with recognition sites for the enzymes PstI, EcoRI, Sail 
and Xbal that can be used for insertion of and expression 
of genes (i.e. BMP) in mammalian cells. In addition, the 
vector contains a second transcription unit encoding Ada 
which serves as a dominant and amplifiable marker in 

10 mammalian cells. 

To construct expression vectors for BMP-5, BMP- 
6 and BMP-7, individually, the same general method was 
employed. The gene for BMP 5 (Fig. 5), 6 (Fig. 4) or 7 
(Fig. 3) was inserted into the polylinker essentially as 

15 described above for the pED4 vector. These vectors can 
be used for transfection into CHO DUKX cells and 
subsequent selection and amplification using the Ada 
marker as previously described [Kaufman et al, Proc. 
Natl. Acad, Sci. USA . SI: 3136-3140 (1986)]. Since each 

20 such vector does not contain a DHFR gene, the resultant 
transformed cells remain DHFR negative and can be 
subsequently transfected with a second vector containing 
a different BMP in conjunction with DHFR and amplified 
with methotrexate. 

25 Alternatively, the pMT3SV2Ada/BMP vectors can 

be used to transfect stable CHO cell lines previously 
transfected with a different BMP gene and amplified using 
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the DHFR/methotrexate system. The resultant 
transfectants can be subsequently amplified using the Ad; 
system, yielding cell lines that coexpress two different 
BMP genes, and are amplified using both the DRTR and Ada 
5 markers. 

H. BMP-Expressing M ammalian Cell Lines 

At present, the most desirable mammalian 
cell lines for use in producing the recombinant 
homodimers and heterodimers of this invention are the 
10 following. These cell lines were prepared by 

conventional transformation of CHO cells using vectors 

described above. 

The BMP-2 expressing cell line 2EG5 is a 
CHO cell stably transformed with the vector pBMP2delta- 
15 EMC. 

The BMP-4 expressing cell line 4E9 is a 
CHO cell stably transformed with the vector pBMP4delta- 
EMC. 

The BMP-5 expressing cell line 5E10 is a 
20 CHO cell stably transformed with the vector BMP5mix-EMC- 
11 (at a amplification level of 2 micromolar MTX) . 

The BMP-6 expressing cell line 6HG8 is a 
CHO cell stably transformed with the vector BMP6/EMC. 

The BMP-7 expressing cell line 7MB9 is a 
25 CHO cell stably transformed with the vector BMP7/pMT21. 

py&MPT.TC 2 - TRANSIENT EXPRESSION O F BMP HETERODIMERS 
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The heterodimers of the present invention may 
be prepared by co-expression in a transient expression 
system for screening in the assays of Example 8 by two 
different techniques as follows. 
5 In the first procedure, the pMT2 -derived and 

EMC-derived expression plasmids described in Example 1 
and other similarly derived vectors were constructed 
which encoded, individually, BMP-2 through BMP-7, and 
transforming growth factor-beta (TGFjSl) . All 

10 combinations of pairs of plasmids were mixed in equal 

proportion and used to co-transf ect CHO cells using the 
DEAE-dextran procedure [Sompayrac and Danna, Proc. Natl. 
Acad. Sci. USA . 28:7575-7578 (1981); Luthman and 
Magnusson, Nucl. Acids Res. , 11:1295-1308 (1983)]. The 

15 cells are grown in alpha Minimal Essential Medium (a -MEM) 
supplemented with 10% fetal bovine serum, adenosine, 
deoxy adenosine, thymidine (100 pg/ml each) , pen/strep, 
and glutamine (1 mM) . 

The addition of compounds such as heparin, 

20 suramin and dextran sulfate are desirable in growth 

medium to increase the amounts of BMP-2 present in the 
conditioned medium of CHO cells. Similarly responsive to 
such compounds is BMP-5. Therefore, it is expected that 
these compounds will be added to growth medium for any 

25 heterodimer containing these BMP components. Other BMPs 
may also be responsive to the effects of these compounds, 
which are believed to inhibit the interaction of the 
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mature BMP molecules with the cell surface. 

The following day, fresh growth medium, with or 
without 100 Mg/ml heparin, was added. Twenty-four hours 
later, conditioned medium was harvested. 
5 in some experiments, the conditioned medium was 

collected minus heparin for the 24-48 hour period post- 
transfection, and the same plates were then used to 
generate conditioned medium in the presence of heparin 
48-72 hour post-transfection. Controls included 
10 transfecting cells with expression plasmids lacking any 
BMP sequences, transfecting cells with plasmids 
containing sequences for only a single BMP, or mixing 
conditioned medium from cells transf ected with a single 
BMP with conditioned medium from cells transf ected with a 

15 different BMP. 

Characterizations of the coexpressed 
heterodimer BMPs in crude conditioned media, which is 
otherwise not purified, provided the following results. 
Transiently coexpressed BMP was assayed for induction of 

20 alkaline phosphatase activity on W20 stromal cells, as 
described in Example 8. 

Co-expression of BMP-2 with BMP-5, BMP-6 and 
BMP-7, and BMP-4 with BMP-5, BMP-6 and BMP- 7 yielded more 
alkaline phosphatase inducing activity in the W20 assay 

25 than either of the individual BMP homodimers alone or 

mixtures of homodimers, as shown below. Maximal activity 
(in vitro ) , was obtained when BMP-2 was coexpressed with 
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BMP-7. Increased activity was also found the 
heterodimers BMP-2/5; BMP-2/6; BMP-4/5; BMP-4/6; and BMP- 
4/7. 



Condition Medium 
TGF-/5 BMP-7 BMP-6 BMP-5 BMP-4 BMP-3 BMP-2 
BMP-2 33 240 99 89 53 9 29 

BMP-3 - - - 14 

BMP-4 12 115 25 22 24 

BMP-5 - 
BMP-6 - 
BMP-7 - 
TGF-0 - 

Condition Medium + heparin 
TGF-/S BMP-7 BMP-6 BMP-5 BMP-4 BMP-3 BMP-2 
BMP-2 88 454 132 127 70 77 169 

BMP-3 7 
BMP-4 7 119 30 41 37 

BMP-5 - 
BMP-6 - 
BMP-7 - 
TGF-0 - 



Units: 1 unit of activity is equivalent to that of 1 ng/ml of rhBMP-2. 
-: indicates activity below the detection limit of the assay. 



These BMP combinations were subsequently expressed 
5 using various ratios of expression plasmids (9:1, 3:1, 
1:1, 1:3, 1:9) during the CHO cell transient 
transfection. The performance of this method using 
plasmids containing BMP-2 and plasmids containing BMP— 7 
at plasmid number ratios ranging from 9:1 to 1:9, 
10 respectively, demonstrated that the highest activity in 
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the W20 assay was obtained when approximately the same 
number of plasmids of each BMP were transfected into the 
host cell. Ratios of BMP-2 to BMP-7 plasmids of 3:1 to 
1:3, respectively, also resulted in increased activity in 
5 W20 assay in comparison to host cells transfected with 
plasmids containing only a single BMP. However, these 
latter ratios produced less activity than the 1:1 ratio. 

Similar ratios may be determined by one of 
skill in the art for heterodimers consisting of other 

10 than BMP-2 and BMP-7. For example, preliminary work on 
the heterodimer formed between BMP-2 and BMP- 6 has 
indicated that a preferred ratio of plasmids for co- 
transfection is 3:1, respectively. The determination of 
preferred ratios for this method is within the skill of 

15 the art. 

As an alternative means to transiently generate 
coexpressed BMPs, the stable CHO cell lines identified in 
Example 1 expressing each BMP-2, BMP-4, BMP-5, BMP -6 and 
BMP-7, are cocultured for one day, and are then fused 

20 with 46.7% polyethylene glycol (PEG). One day post- 
fusion, fresh medium is added and the heterodimers are 
harvested 24 hours later for the W20 assay, described in 
Example 8. The assay results were substantially similar 
to those described immediately above. 

25 Therefore, all combinations of BMP-2 or 4 

coexpressed with either BMP-5, 6 or 7 yielded greater 
activity than any of the BMP homodimers alone. In 
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control experiments where each BMP homodimer was 
expressed alone and conditioned media mixed post harvest, 
the activity was always intermediate between the 
individual BMPs, demonstrating that the BMP co-expressed 
5 heterodimers yield higher activity than combinations of 
the individually expressed BMP homodimers. 

EXAMPLE 3 - STABLE EXPRESSION OF BMP HETERODIMERS 
A. BOT-3/7 

Based on the results of the transient assays in 

10 Example 2, stable cell lines were made that co-express 
BMP-2 and BMP-7. 

A preferred stable cell line, 2E7E-10, was 
obtained as follows: Plasmid DNA (a 1:1 mixture of pBMP- 
7-EMC and pBMP-2-EMC, described in Example 1) is 

15 transfected into CHO cells by electroporation [Neuman et 
al, EMBO J. , 1:841-845 (1982)]. 

Two days later, cells are switched to selective 
medium containing 10% dialyzed fetal bovine serum and 
lacking nucleosides. Colonies expressing DHFR are 

20 counted 10-14 days later. Individual colonies or pools 
of colonies are expanded and analyzed for expression of 
each heterodimer BMP component RNA and protein using 
standard procedures and are subsequently selected for 
amplification by growth in increasing concentrations of 

25 MTX. Stepwise selection of the preferred clone, termed 

2E7E, is carried out up to a concentration of 0.5 /iM MTX. 
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The cell line is then subcloned and assayed for 
heterodimer 2/7 expression. 

Procedures for such assay include Western blot 
analysis to detect the presence of the component DNA, 
protein analysis and SDS-PAGE analysis of metabolically 
labelled protein, W20 assay, and analysis for cartilage 
and/or bone formation activity using the ectopic rat bone 
formation assay of Example 9. The presently preferred 
clonally-derived cell line is identified as 2E7E-10. 
This cell line secretes BMP-2/7 heterodimer proteins into 
the media containing 0.5 fM MTX. 

The CHO cell line 2E7E-10 is grown in 
Dulbecco's modified Eagle's medium (DMEM) /Ham's nutrient 
mixture F-12, 1:1 (vol/vol), supplemented with 10% fetal 
bovine serum. When the cells are 80 to 100% confluent, 
the medium is replaced with serum-free DMEM/F-12. Medium 
is harvested every 24 hours for 4 days. For protein 
production and purification the cells are cultured serum- 
free. 

While the co-expressing cell line 2E7E-10 
preliminarily appears to make lower amounts of BMP 
protein than the BMP2-expressing cell line 2EG5 described 
in Example 2, preliminary evidence suggests that the 
specific activity of the presumptive heterodimer is at , 
least 5-fold greater than BMP-2 homodimer (see Example 
6). 

To construct another heterodimer producing cell 
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line, the stable CHO cell line 7MB9, previously 
transfected with pBMP-7-pMT2, and which expresses BMP-7, 
is employed. 7MB9 may be amplified and selected to 2 nH 
methotrexate resistance using the DHFR/MTX system. To 
5 generate a stable co-expressing cell line, cell line 7MB9 
is transfected with the expression vector pBMP-2A-EN 
(EMC-Neo) containing BMP-2 and the neomycin resistance 
gene from the Tn5 transposable element. The resulting 
transfected stable cell line was selected for both G-418 

10 and MTX resistance. Individual clones were picked and 
analyzed for BMP expression, as described above. 

It is anticipated that stable cell lines co- 
expressing other combinations of BMPs which show enhanced 
activity by transient coexpression will likewise yield 

15 greater activity upon stable expression. 

B. BMP-2/6 

Based on the results of the transient assays in 
Example 2, stable cell lines were made that co-express 
BMP-2 and BMP- 6. 

20 A preferred stable cell line, 12C07, was 

obtained as follows: Plasmid DNA (a 1:3 mixture of pBMP- 
6-EMC and pBMP-2-EMC, described in Example 1) is 
transfected into CHO cells by electroporation [Neuman et 
al, EMBO J. . 1:841-845 (1982)]. 

25 Two days later, cells are switched to selective 

medium containing 10% dialyzed fetal bovine serum and 
lacking nucleosides. Colonies expressing DHFR are 
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counted 10-14 days later. Individual colonies or pools 
of colonies are expanded and analyzed for expression of 
each heterodimer BMP component RNA and protein using 
standard procedures and are subsequently selected for 
amplification by growth in increasing concentrations of 
MTX. Stepwise selection of the preferred clone, termed 
12-C, is carried out up to a concentration of 2.0 /xM MTX. 
The cell line is then subcloned and assayed for 
heterodimer 2/6 expression. 

Procedures for such assay include Western blot 
analysis to detect the presence of the component DNA, 
protein analysis and SDS-PAGE analysis of metabolically 
labelled protein, W20 assay, and analysis for cartilage 
and/or bone formation activity using the ectopic rat bone 
formation assay of Example 9. The presently preferred 
clonally-derived cell line is identified as 12C07. This 
cell line secretes BMP-2/6 heterodimer proteins into the 
media containing 2.0 jiM MTX. 

The CHO cell line 12C07 is grown in Dulbecco's 
modified Eagle's medium (DMEM) /Ham's nutrient mixture F- 
12, 1:1 (vol/vol) , supplemented with 10% fetal bovine 
serum. When the cells are 80 to 100% confluent, the 
medium is replaced with serum-free DMEM/F-12. Medium is 
harvested every 24 hours for 4 days. For protein 
production and purification the cells are cultured serum- 
free. 

While the co-expressing cell line 12C07 
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preliminarily appears to make lower amounts of BMP 
protein than the BMP2 -expressing cell line 2EG5 described 
in Example 2, preliminary evidence suggests that the 
specific activity of the presumptive heterodimer is at 
5 least 3-5-fold greater than BMP-2 homodimer (see Example 

6). 

To construct another heterodimer producing cell 
line, the stable CHO cell line 2EG5, previously 
transfected with pBMP-2-EMC, and which expresses BMP-2, 

10 is. employed. 2EG5 may be amplified and selected to 2 jiM 
methotrexate resistance using the DHFR/MTX system. To 
generate a stable co-expressing cell line, cell line 2EG5 
is transfected with the expression vector pBMP-6-ada (ada 
deaminase) containing BMP-6 and the ADA resistance gene. 

15 The resulting transfected stable cell line was selected 
for both DCF and MTX resistance. Individual clones are 
picked and analyzed for BMP expression, as described 
above. 

It is anticipated that stable cell lines co- 
20 expressing other combinations of BMPs which show enhanced 
activity by transient coexpression will likewise yield 
greater activity upon stable expression. 

EXAMPLE 4-PURIFICATION OF BMP2/7 AND BMP-2 IS HETERODIMER 
The same purification procedure is used for BMP-2/ 6 
25 heterodimer and BMP-2 /7 heterodimer. Conditioned media 
from cultures of cell line 2E7E-10 or 12C07 containing 
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recombinantly produced BMP heterodimer 2/7V or 2/6, 
respectively, can be generated from either adherent or 
suspension cultures. For small to medium scale 
generation of coexpressed BMP, adherent cultures are 
5 seeded into roller bottles and allowed to grow to 

confluence in alpha-Minimal Eagles Medium [a-MEM, Gibco, 
Grand Island, NY] containing 10% dialy2ed heat- 
inactivated fetal calf serum [Hazleton, Denver, PA]. The 
media is then switched to a serum-free, albumin free, low 

10 protein medium based on a 50:50 mixture of Delbecco's 

Modified Eagle's medium and Hams F-12 medium, optionally 
supplemented with 100 micrograms/ml dextran sulfate. 
Four or five daily harvests are pooled, and used to 
purify the recombinant protein. 

15 conditioned medium from roller bottle cultures 

obtained as described above was thawed slowly at room 
temperature and pooled. The pH of the pooled medium was 
adjusted to pH 8.0 using 1 M Tris, pH 8.0. A column was 
poured containing Matrex Celluf ine Sulfate [Amicon] and 

20 equilibrated in 50 mM Tris, pH 8.0. 

Upon completion of loading of the medium, the 
column was washed with buffer containing 50 mM Tris, 0.4 
M NaCl, pH 8.0 until the absorbance at 280 nm reached 
baseline. The column was then washed with 50 mM Tris, pH 

25 8.0 to remove NaCl from the buffer. The resin was then 
washed with 50 mM Tris, 0.2 M NaCl, 4 M Urea, pH 8.0 
until a peak had eluted. The column was then washed into 
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50 mM Tris, pH 8.0 to remove the urea. 

The bound BMP-2/7 or BMP-2/6 was then eluted 
using 50 mM Tris, 0.5 M NaCI, 0.5 M Arginine, pH 8.0. 
The eluate was collected as a single pool and may be 
5 optionally stored frozen prior to further purification. 

This Cellufine Sulfate eluate was diluted with 14 volumes 
of 6M urea and the pH of the sample was then adjusted to 
6.0. A hydroxy apatite-Ultrogel [IBF] column was poured 
and equilibrated with 80 mM potassium phosphate, 6M urea, 
10 pH 6.0. 

After the completion of sample loading, the 
column was washed with 10 bed volumes of the 
equilibration buffer. Bound BMP-2/7 or BMP-2/6 
heterodimers were eluted with 5 bed volumes of 100 mM 

15 potassium phosphate, 6M urea, pH 7.4. This eluate was 

loaded directly onto a Vydac C 4 reverse-phase HPLC column 
equilibrated in water - 0.1% TFA. BMP-2/7 or BMP-2/6 
heterodimers were eluted with a gradient of 30-50% 
acetonitrile in water - 0.1% trif luoroacetic acid. 

20 Fractions containing BMPs are identified by SDS-PAGE 

in the presence or absence of reductant. The identity of 
the BMPs with respect to the heterodimers vs. homodimers 
is determined by 2D-PAGE (+/- reductant) . Fractions with 
heterodimers gave bands which reduce to two spots. Bands 

25 from homodimer fractions reduce to a single spot for each 
BMP species. 
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The BMP-2/6 heterodimer subunits are analyzed 
on a protein sequenator. BMP-2/6 heterodimers of the 
followig species are present: BMP-6 subunit beginning 
with amino acid #375 Ser-Ala-Ser-Ser in association with 
BMP-2 subunit beginning with amino acid #283 Gin-Ala-Lys 
or #249 Ser-Lev-His, though other less abundant species 
may be present. 

It is contemplated that the same or substantially similar 
purification techniques may be employed for any 
recombinant BMP heterodimer of this invention. The 
hydroxyapatite-Ultrogel column may be unnecessary and 
that the purification scheme may be modified by loading 
the Celluf ine Sulfate eluate directly onto the C 4 reverse- 
phase HPLC column without use of the former column for 
BMP2/7 or BMP-2/6 or the other heterodimers of this 
invention. 



EXAMPLE 5 - PROTETW CHARACTERIZATION 

Total protein secreted from the co-expressing 
cell lines is analyzed after labelling with 35 S-methionine 
or by Western blot analysis using antibodies raised 
against both BMPs of the heterodimer, e.g., BMP-2 and 
BMP-7. Together with the alkaline phosphatase assays, 
the data indicates the presence of the heterodimer and 
the specific activity. The following specific details 
are directed towards data collected for the BMP-2 /7 and 
BMP-2/6 heterodimers; however, by application of similar 
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methods to the other heterodimers described herein, 
similar results are expected. 

A. 35 S-Met labelling 

Cell lines derived by cotransf ection of 
5 BMP2A-EMC and BMP7a-EMC expression vectors were pulsed 

with 35 S-methionine for 15 minutes , and chased for 6 hours 
in serum free media in the presence or absence of 
heparin. Total secreted protein was analyzed under 
reducing conditions by PAGE and f luorography. The 
10 results demonstrate that several cell lines secrete both 
BMP-2 and BMP-7 protein. There i6 a good correlation 
between the amount of alkaline phosphatase activity and 
the amount of coexpressed protein. 



Several cell lines secrete less total BMP- 



15 



2 and 7 than the BMP-2-only expressing cell line 2EG5 , 



which produces 10 fig/ml BMP-2. Cell line 2E7E-10 
(amplified at a level of 0.5mM MTX) secretes equal 



proportions of BMP-2 and BMP-7 at about the same overall 



level of expression as the cell line 2EG5. Cell line 



20 



2E7E-10 produces the equivalent of 600 micrograms/ml of 



BMP-2 homodimer activity in one assay. 



Total labelled protein was also analyzed on a 



two-dimensional non-reducing/ reducing gel system to 



ascertain whether a heterodimer is made. Preliminary 



25 



results demonstrate the presence of a unique spot in this 



gel system that is not found in either the BMP-2-only or 



BMP-7-only cell lines, suggesting the presence of 2/7 
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heterodimer. The same gel with purified material 
produced the same results (e.g. , two unique spots on the 
gel) indicative of the presence of the 2/7 heterodimer. 
The homodimer of BMP2 produced distinct species on this 

5 gel system. 

In contrast to the recombinant BMP-2/7 purification, 
BMP-2 homodimers are not detected during the BMP-2/6 
preparation; however, significant amounts of BMP-6 
homodimers are found. In addition, a significant amount 

.0 of a -20 amino acid N-terminal truncated form of BMP-6 is 
found; this could be eliminated by the inclusion of 
protease inhibitors during cell culture. BMP-2/6 was 
found to elute two to three fractions later from C4 RP- 
HPLC than did BMP-2/7. 

.5 Amino acid sequencing indicates that the predominant 

BMP-2/7 heterodimer species comprises a mature BMP-2 
subunit [amino acid #283 (Gln)-#396(Arg) ] and a mature 
subunit of BMP-7 [#293 (Ser)-#431(His) ] . BMP-2/6 
heterodimer comprises the mature BMP-2 subunit (#283-396) 

0 and the mature BMP-6 subunit [#375(Ser)-#513 (His) ] . 



B. TmmuTioprecipitation coupled to Western blot analysis 

Conditioned media from a BMP-2-only 
(2EG5) , a BMP-7-only (7MB9) , or the 2E7E-10 co-expressing 
cell line were subjected to immunoprecipitation with 
either a BMP-2 or BMP-7 antibody (both conventional 
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polyclonal antibodies raised in rabbits) , then analyzed 
on Western blots probed with either an anti-BMP-2 or 
anti-BMP-7 antibody. The 2/7 heterodimer precipitates 
and is reactive on Western blots with both the BMP-2 and 
5 BMP-7 antibodies, while either BMP by itself reacts with 
its specific antibody, but not with the reciprocal 
antibody. 

It has been demonstrated using this 
strategy that a protein in the co-expressing cell line 

10 that is precipitated by the anti-BMP-7 antibody W33 

[Genetics Institute, Inc, Cambridge, Massachusetts] and 
reacts on a Western blot with the anti-BMP-2 antibody W12 
or W10 [Genetics Institute, Inc.] is not present in the 
BMP-2 or 7-only expressing cell lines. This experiment 

15 indicates that this protein species is the heterodimeric 
protein. Conversely, precipitation with W12 and probing 
with W33 yielded similar results. 

EXAMPLE 6 - SPECIFIC ACTIVITY OF HETEROD IMERS 
A. In vitro Assays 
20 The specific activity of the BMP-2/7 or BMP-2/6 

heterodimer and the BMP-2 homodimer secreted into growth 
medium of the stable cell lines 2E7E-10 and 2EG55, and 
12C07 and 2EG5, respectively, were estimated as follows. 

The amount of BMP protein in conditioned medium 
25 was measured by either Western blot analysis or by 
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analyzing protein secreted from ^S-methionine labelled 
cells by PAGE and f luorography. The amount of activity 
produced by the same cell lines on W20 cells using either 
the alkaline phosphatase assay or osteocalcin-induction 
5 assay was then estimated. The specific activity of the 
BMP was calculated from the ratio of activity to protein 
secreted into the growth medium. 

In one experiment 2E7E-10 and 2EG5 secreted 
similar amounts of total BMP proteins as determined by 

10 PAGE and f luorography. 2E7E-10 produced about 50-fold 
more alkaline phosphatase inducing activity the 2EG5, 
suggesting that the specific activity of the heterodimer 
is about 50-fold higher than the homodimer. 

In another experiment the amount of BMP-2 

15 secreted by 2EG5 was about 50% higher than BMP-2/ 7 

secreted by 2E7E-10, however, 2E7E-10 produced about 10- 
fold more osteocalcin-inducing activity that 2EG5. From 
several different experiments of this type the specific 
activity of the BMP-2/ 7 heterodimer is estimated to be 

20 between 5 to 50 fold higher than the BMP-2 homodimer. 

Figures 8 and 9 compare the activity of BMP-2 
and BMP-2/ 7 in the W20 alkaline phosphatase and BGP (Bone 
Gla Protein, osteocalcin) assays. BMP-2/7 has greatly 
increased specific activity relative to BMP-2 (Figure 8) . 

25 From Figure 8, approximately 1.3 ng/ml of BMP-2/7 was 
sufficient to induce 50% of the maximal alkaline 
phosphatase response in W-20 cells. A comparable value 
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for BMP-2 is difficult to calculate, since the alkaline 
phosphatase response did not maximize, but greater than 
30 ng/ml is needed for a half-maximal response. BMP-2/7 
thus has a 20 to 30-fold higher specific activity than 

5 BMP-2 in the W-20 assay. 

As seen in Figure 9, BMP-2/7 was also a more 
effective stimulator of BGP (bone gla protein, 
osteocalcin) production than BMP-2 in this experiment. 
Treating W-20-17 cells with BMP-2/7 for four days 

0 resulted in a maximal BGP response with 62 ng/ml, and 11 
ng/ml elicits 50% of the maximal BGP response. In 
contrast, maximal stimulation of BGP synthesis by BMP-2 
was not seen with doses up to 468 ng/ml of protein. The 
minimal dose of BMP-2/7 needed to elicit a BGP response 

5 by W-20-17 cells was 3.9 ng/ml, about seven-fold less 

than the 29 ng/ml required of BMP-2. These results were 
consistent with the data obtained in the W-20-17 alkaline 
phosphatase assays for BMP-2 and BMP-2/7. 

Preliminary analysis indicates that BMP-2/6 has 

0 a specific activity in vitro similar to that of BMP-2/7. 
The potencies of BMP-2 and BMP-2/6 on induction of 
alkaline phosphatase production in W-20 is compared, as 
shown in Figure 12, BMP-2/6 has a higher specific 
activity than BMP-2 in this assay system. This data is 

5 in good agreement with data obtained from the in vivo 
assay of BMP-2 and BMP-2/6). 
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B. Tn Vivo Assay 
(i) BMP-2/7 

The purified BMP-2/7 and BMP-2 were tested in 
the rat ectopic bone formation assay. A series of 
5 different amounts of BMP-2/7 or BMP-2 were implanted in 
triplicate in rats. After 5 and 10 days, the implants 
were removed and examined histologically for the presence 
of bone and cartilage. The histological scores for the 
amounts of new cartilage and bone formed are summarized 
10 in Table A. 
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Table A 

5 Day implants 10 Day Implants 



BMP-2/7 BMP-2 BMP-2/7 BMP-2 



0.04 C ± - ± ± - ± 

B ± - ± 

0.02 C ± 1 ± 212 - ± ± 

B 1 ± 1 - ± - 

1.0 C 1 ± ± ±±± 222 1 li 

B 233 1 1 ± 

5.0 C 2 2 1 1 ± 1 112 12 1 

B ±-1 4 4 3 232 

25.0 C ± ± 2 2 2 2 

B 4 4 3 3 3 3 



The amount of BMP-2/7 required to induce cartilage and 
bone in the rat ectopic assay is lower than that of BMP- 
2. Histologically, the appearance of cartilage and bone 
induced by BMP-2/7 and BMP-2 are identical. 
5 (ii) BMP-2/6 

The in vivo activity of BMP-2/6 was compared with 
that of BMP-2 by implantation of various amounts of each 
BMP for ten days in the rat ectopic bone formation assay. 
The results of this study (Table B, Figure 13) indicate 

10 that BMP-2/6, similar to BMP-2/7, has increased in vivo 
activity relative to BMP-2. The specific activities of 
BMP-2, BMP-6, and BMP-2/6 are compared in the ectopic 
bone formation assay ten days after the proteins are 
implanted. The results of these experiments are shown in 

15 Table C and Figure 14. BMP-2/6 is a more potent inducer 

of bone formation than either BMP-2 or BMP-6. The amount 
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of bone formation observed with BMP-2/6 was comparable 
that observed with equivalent doses of BMP-2/7. The 
appearance of BMP-2/6 implants is quite similar to 
implants containing BMP-2 or BMP-2/7. 

Table B 

Histological scores of Implants of BMP 2/6 and BMP-2 In rat ectopic 



assay (10 day implants). 



10 



BMP 0/g) 


C/B 


BMP-2/6 


BMP-2 


0.04 


c 

B. 


- ± - 




0.20 


C 
B 


l l ± 
± ± ± 




1.0 


C 
B 


13 3 
12 2 


l 1 ± 
l 1 ± 


5.0 


C 
B 


2 2 2 
2 3 3 


12 2 
2 2 2 


25. 


C 
B 


111 

3 3 3 


2 2 1 

3 3 3 



Table C 

Histological scores of implants of BMP-2, BMP-6, and BMP-2/6 in rat 
ectopic assay (10 day implants). 



20 



BMP [fig) 


C/B 


BMP-2 


BMP-6 


BMP-2/6 


0.04 


c 

B 






- - ± 

— ± 


0.20 


C 
B 


- - 2 

- - 1 




12 2 
2 2 2 


1.0 


C 
B 


- ± ± 

- ± ± 


2 11 
1 ± ± 


111 

3 3 2 


5.0 


C 
B 


2 2 1 
111 


3 13 
2 ± 1 


± ± 1 
4 5 4 


25. 


C 
B 


± ± ± 
5 4 5 


± ± ± 
4 4 5 


± ± ± 
4 5 3 



TlYAMPIiE 7 - EXPRESSION OF BM P DIMER IN E. CPU 

A biologically active, homodimeric BMP-2 was 
expressed in coli using the techniques described in 
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European Patent Application 433,255 with minor 
# modifications. Other methods disclosed in the above- 

referenced European patent application may also be 
employed to produce heterodimers of the present invention 
5 from E. coli . Application of these methods to the 

heterodimers of this invention is anticipated to produce 
active BMP heterodimeric proteins from E. coli. 
A. BMP- 2 Expression Vector 

An expression plasmid pALBP2-781 (Figure 

10 7) (SEQ ID NO: 13) was constructed containing the mature 
portion of the BMP-2 (SEQ ID NO: 14) gene and other 
sequences which are described in detail below. This 
plasmid directed the accumulation of 5-10% of the total 
cell protein as BMP-2 in an E^ coli host strain, GI724, 

15 described below. 

Plasmid pALBP2-781 contains the following 
principal features. Nucleotides 1-2060 contain DNA 
sequences originating from the plasmid pUC-18 [Norrander 
et al, Gene . 26:101-106 (1983)] including sequences 

20 containing the gene for /3-lactamase which confers 

resistance to the antibiotic ampicillin in host JLj_ coli 
strains, and a colEl-derived origin of replication. 
Nucleotides 2061-2221 contain DNA sequences for the major 
leftward promoter (pL) of bacteriophage A [Sanger et al, 

25 J. Mol. Biol. . 162:729-773 (1982)], including three 

operator sequences, 0 L l, 0 L 2 and 0 L 3. The operators are 
the binding sites for Xcl repressor protein, 
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intracellular levels of which control the amount of 
transcription initiation from pL. Nucleotides 2222-2723 
contain a strong ribosome binding sequence included on a 
sequence derived from nucleotides 35566 to 35472 and 
5 38137 to 38361 from bacteriophage lambda as described in 
Sanger et al, J- Mol. Biol. . 162:729-773 (1982). 
Nucleotides 2724-3133 contain a DNA sequence encoding 
mature BMP-2 protein with an additional 62 nucleotides of 
3 ' -untranslated sequence. 

10 Nucleotides 3134-3149 provide a "Linker" DNA 

sequence containing restriction endonuclease sites. 
Nucleotides 3150-3218 provide a transcription termination 
sequence based on that of the IS-, coli aspA gene [Takagi 
et al, wucl. Acids Res. . 13:2063-2074 (1985)]. 

15 Nucleotides 3219-3623 are DNA sequences derived from pUC- 
18. 

As described below, when cultured under 
the appropriate conditions in a suitable JL. SSli host 
strain, pALBP2-781 can direct the production of high 
20 levels (approximately 10% of the total cellular protein) 

of BMP-2 protein. 

pALBP2-781 was transformed into the JL. coli 
host strain GI724 (F, lacT", lacP", ampC::XcI + ) by the 
procedure of Dagert and Ehrlich, Gene . 6:23 (1979). [The 
25 untransformed host strain Ej. coli GI724 was deposited 

with the American Type Culture Collection, 12301 Parklawn 
Drive, Rockville, Maryland on January 31, 1991 under ATCC 
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No. 55151 for patent purposes pursuant to applicable laws 
and regulations.] Transf ormants were selected on 1.5% 
w/v agar plates containing IMC medium, which is composed 
of M9 medium [Miller, "Experiments in Molecular 
5 Genetics", Cold Spring Harbor Laboratory, New York 

(1972)] supplemented with 0.5% w/v glucose, 0.2% w/v 
casamino acids and 100 iig/ral ampicillin. 

GI724 contains a copy of the wild-type Xcl 
repressor gene stably integrated into the chromosome at 

10 the ampC locus, where it has been placed under the 

transcriptional control of Salmonella tvphimurium trp 
promoter /operator sequences. In GI724, Xcl protein is 
made only during growth in tryptophan-free media, such as 
minimal media or a minimal medium supplemented with 

15 casamino acids such as IMC, described above. Addition of 
tryptophan to a culture 'of GI724 will repress the trp 
promoter and turn off synthesis of Xcl, gradually causing 
the induction of transcription from pL promoters if they 
are present in the cell. 

20 GI724 transformed with pALBP2-781 was 

grown at 37 °C to an A^ of 0.5 (Absorbence at 550 nm) in 
IMC medium. Tryptophan was added to a final 
concentration of 100 ng/ml and the culture incubated for 
a further 4 hours. During this time BMP-2 protein 

25 accumulated to approximately 10% of the total cell 
protein, all in the "inclusion body" fraction. 

BMP-2 is recovered in a non-soluble, 
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monomer ic form as follows. Cell disruption and recovery 
is performed at 4°C. Approximately 9 g of the wet 
fermented E. coli GI724/pALBP2-781 cells are suspended in 
30 mL of 0.1 M Tris/HCl, 10 mM EDTA, 1 mM phenyl methyl 
sulphonyl fluoride (PMSF) , pH 8.3 (disruption buffer). 
The cells are passed four times through a cell disrupter 
and the volume is brought to 100 mL with the disruption 
buffer. The suspension is centrifuged for 20 min. 
(15,000 x g). The pellet obtained is suspended in 50 mL 
disruption buffer containing 1 M NaCl and centrifuged for 
10 min. as above. The pellet is suspended in 50 mL 
disruption buffer containing 1% Triton X-100 (Pierce) and 
again centrifuged for 10 min. as above. The washed 
pellet is then suspended in 25 mL of 20 mM Tris/HCl , 1 mM 
EDTA, 1 mM PMSF, 1% DTT, pH 8.3 and homogenized in a 
glass homogenizer. The resulting suspension contains 
crude monomer ic BMP-2 in a non-soluble form. 

Ten mL of the BMP-2 suspension, obtained 
as described above, are acidified with 10% acetic acid to 
pH 2.5 and centrifuged in an Eppendorf centrifuge for 10 
min. at room temperature. The supernatant is 
chromatographed. Chromatography was performed on a 
Sephacryl S-100 HR column (Pharmacia, 2.6 x 83 cm) in 1% 
acetic acid at a flow rate of 1.4 mL/minute. Fractions 
containing monomeric, BMP-2 are pooled. This material is 
used to generate biologically active, homodimer BMP-2. 

Biologically active, homodimer ic BMP-2 can 
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be generated from the monomeric BMP-2 obtained following 
solubilization and purification, described above, as 
follows. 

0.1, 0.5 or 2.5 mg of the BMP-2 is 
5 dissolved at a concentration of 20, 100 or 500 /-ig/mL, 

respectively, in 50 mM Tris/HCl, pH 8.0, 1 M NaCl, 5 mM 
EDTA, 2 mM reduced glutathione, 1 mM oxidized glutathione 
and 33 mM CHAPS [Calbiochem] . After 4 days at 4°C or 
23°C, the mixture is diluted 5 to 10 fold with 0.1% TFA. 
10 Purification of biologically active BMP-2 

is achieved by subjecting the diluted mixture to reverse 
phase HPLC on a a Vydac C4 214TP54 column (25 x .46 cm) 
[The NEST Group, USA] at a flow rate of 1 ml /minute. 
Buffer A is 0.1% TFA. Buffer B is 90% acetonitrile, and 
15 0.1% TFA. The linear gradient was 0 to 5 minutes at 20% 
Buffer B; 5 to 10 minutes at 20 to 30 % Buffer B;' 10 to 
40 minutes at 3 0 to 60% Buffer B; and 40 to 50 minutes at 
60 to 100% Buffer B. Homodimeric BMP-2 is eluted and 
collected from the HPLC column. 
20 The HPLC fractions are lyophilized to 

dryness, redissolved in sample buffer (1.5 M Tris-HCl, pH 
8.45, 12% glycerol, 4% SDS, .0075% Serva Blue G, .0025% 
Phenol Red, with or without 100 mM dithiothreitol) and 
heated for five minutes at 95 °C. The running buffer is 
25 100 mM Tris, 100 mM tricine (16% tricine gel) [Novex], 

0.1% SDS at pH 8.3. The SDS-PAGE gel is run at 125 volts 
for 2.5 hours. 
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The gel is stained for one hour with 200 
ml of 0.5% Coomassie Brilliant Blue R-250, 25% 
isopropanol, 10% acetic acid, heated to 60-C. The gel is 
then destained with 10% acetic acid, 10% isopropanol 
5 until the background is clear. 

The reduced material ran at approximately 
13kD; the non-reduced material ran at approximately 30 
kD, which is indicative of the BMP-2 dimer. This 
material was later- active in the W20 assay of Example 8. 
10 B. BMP-7 Expression Vector 

For high level expression of BMP-7 a 
plasmid pALBMP7-981 was constructed. pAlBMP7-981 is 
identical to plasmid pALBP2-781 with two exceptions: the 
BMP-2 gene (residues 2724-3133 of pALBP2-781) is replaced 
15 by the mature portion of the BMP-7 gene, deleted for 

sequenced encoding the first seven residues of the mature 
BMP-7 protein sequence: 
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ATGTCTCATAATC GTTCTAAAAC TCCAAAAAAT CAAGAAGCTC TGCGTATGGC 



CAACGTGGCA 


GAGAACAGCA 


GCAGCGACCA 


GAGGCAGGCC 


TGTAAGAAGC 


ACGAGCTGTA 


TGTCAGCTTC 


CGAGACCTGG 


GCTGGCAGGA 


CTGGATCATC 


GCGCCTGAAG 


GCTACGCCGC 


CTACTACTGT 


GAGGGGGAGT 


GTGCCTTCCC 


TCTGAACTCC 


TACATGAACG 


CCACCAACCA 


CGCCATCGTG 


CAGACGCTGG 


TCCACTTCAT 


CAACCCGGAA 


ACGGTGCCCA 


AGCCCTGCTG 


TGCGCCCACG 


CAGCTCAATG 


CCATCTCCGT 


CCTCTACTTC 


GATGACAGCT 


CCAACGTCAT 


CCTGAAGAAA 


TACAGAAACA 


TGGTGGTCCG 


GGCCTGTGGC 


TGCCACTAGC 


TCCTCCGAGA 


ATTCAGACCC 


TTTGGGGCCA 


AGTTTTTCTG 


GATCCT 



10 and the ribosome binding site found between residues 

2707 and 2723 in pALBP2-781 is replaced by a different 
ribosome binding site, based on that found preceding the 
T7 phage gene 10, of sequence 5 9 -CAAGAAGGAGATATACAT-3 9 . 
The host strain and growth conditions used for the 

15 production of BMP-7 were as described for BMP-2. 

C. BMP -3 Expression Vector 

For high level expression of BMP-3 a 
plasmid pALB3-782 was constructed. This plasmid is 
identical to plasmid pALBP2-781, except that the BMP-2 

20 gene (residues 2724-3133 of pALBP2-781) is replaced by a 

gene encoding a form of mature BMP-3. The sequence of 
this BMP-3 gene is: 
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ATGCGTAAAC AATGGATTGA ACCACGTAAC TGTGCTCGTC GTTATCTGAA 
AGTAGACTTT GCAGATATTG GCTGGAGTGA ATGGATTATC TCCCCCAAGT 
CCTTTGATGC CTATTATTGC TCTGGAGCAT GCCAGTTCCC CATGCCAAAG 
TCTTTGAAGC CATCAAATCA TGCTACCATC CAGAGTATAG TGAGAGCTGT 
5 GGGGGTCGTT CCTGGGATTC CTGAGCCTTG CTGTGTACCA GAAAAGATGT 

CCTCACTCAG TATTTTATTC TTTGATGAAA ATAAGAATGT AGTGCTTAAA 
GTATACCCTA ACATGACAGT AGAGTCTTGC GCTTGCAGAT AACCTGGCAA 
AGAACTCATT TGAATGCTTA ATTCAAT 

The host strain and growth conditions used for the 
10 production of BMP-3 were as described for BMP-2. 

D. Bvnression of a BMP -2 II Heterodimer in E. 

coli 

Denatured and purified EL. coli BMP-2 and BMP -7 
monomers were isolated from JL. coli inclusion body 
15 pellets by acidification and gel filtration as previously 
as previously described above. 125 ug of each BMP in 1% 
acetic acid were mixed and taken to dryness in a speed 
vac. The material was resuspended in 2.5 ml 50 mM Tris, 
1.0 NaCl, 5 mM EDTA, 33 mM CHAPS, 2 mM glutathione 
20 (reduced), 1 mM glutathione (oxidized), pH 8.0. The 
sample was incubated at 23 C for one week. 

The BMP-2/ 7 heterodimer was isolated by 
HPLC on a 25 x 0.46 cm Vydac C4 column. The sample was 
centrifuged in a microfuge for 5 minutes, and the 
25 supernatant was diluted with 22.5 ml 0.1% TFA. 

A buffer : 0.1% TFA 

B buffer : 0.1% TFA, 95% acetonitrile 
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1.0 ml /minute 

0-5' 20% B 

5-10' 20-30% B 

10-90' 30-50% B 
5 90-100' 50-100% B 

By SDS-PAGE analysis, the BMP-2/7 heterodimer eluted at 
about 23' . 

Figure 10 is a comparison of the W-20 activity of E. 
coli BMP-2 and BMP-2/7 heterodimer, indicating greater 
10 activity of the heterodimer. 

F. Expression of BMP-2 / 3 Heterodimer in E. 

coli 

BMP-2 and BMP-3 monomers were isolated as 
follows: to 1.0 g of frozen harvested cells expressing 
15 either BMP-2 or BMP-3 was added 3.3 ml of 100 mM Tris, 10 
mM EOTA, pH 8.3. The cells were resuspended by vortexing 
vigorously. 33 ul of 100 mM PMSF in isopropanol was 
added and the cells lysed by one pass through a French 
pressure cell. The lysate was centrifuged in a microfuge 
20 for 20 minutes at 4 C. The supernatant was discarded. 

The inclusion body pellet was taken up in 8.0 M quanidine 
hydrochloride, 0.25 M OTT, 0.5 M Tris, 5 mM EDTA, pH 8.5, 
and heated at 37 C for one hour. 

The reduced and denatured BMP monomers were isolated 
25 by HPLC on a Supelco C4 guard column as follows: 

A buffer : 0.1% TFA 

B buffer : 0.1% TFA, 95% acetonitrile 
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1.0 ml/minute 

0-5' 1% B 

5-40' 1-70% B 

40-45' 70-100% B 
Monomeric BMP eluted at 28-30'. Protein concentration 
was estimated by A280 and the appropriate extinction 
coefficient. 

10 ug of BMP-2 and BMP-3 were combined and taken to 
dryness in a speed vac. To this was added 50 ul of 50 
mM Tris, 1.0 M NaCl, 5 mM EDTA, 33 mM CHAPS, 2 mM reduced 
glutathione, 1 mM oxidized glutathione, pH 8.5. The 
sample was incubated at 23 for 3 days. The sample was 
analyzed by SDS-PAGE on a 16% tricine gel under reducing 
and nonreducing conditions. The BMP-2/3 heterodimer 
migrated at about 35 kd nonreduced, and reduced to BMP-2 
monomer at about 13 kd and BMP-3 monomer at about 21 kd. 

BMP-2/3 heterodimer produced in E. coli is 
tested for in vivo activity. (20 /tg) at (ten days) is 
utilized to compare the in vivo activity of BMP-2/3 to 
BMP-2. BMP-2/3 implants showed no cartilage or bone 
forming activity, while the BMP-2 control implants showed 
the predicted amounts of bone and cartilage formation. 
The in vivo data obtained with BMP-2/3 is consistent with 
the in vitro data from the W-20 assay. 
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EXAMPLE 8 - W-20 BIOASSAYS 

A. Description of W-20 cells 

Use of the W-20 bone marrow stromal cells 
as an indicator cell line is based upon the conversion of 
5 these cells to osteoblast-like cells after treatment with 
BMP-2 [R. S. Thies et al, "Bone Morphogenetic Protein 
alters W-20 stromal cell differentiation in vitro", 
Journal of Bone and Mineral Research . 5(2):305 (1990); 
and R. S. Thies et al, "Recombinant Human Bone 

10 Morphogenetic Protein 2 Induces Osteoblastic 

Differentiation in W-20-17 Stromal Cells", Endocr inolocry , 
in press (1992) ]• Specifically, W-20 cells are a clonal 
bone marrow stromal cell line derived from adult mice by 
researchers in the laboratory of Dr. D. Nathan, 

15 Children's Hospital, Boston, MA. BMP-2 treatment of W-20 
cells results in (1) increased alkaline phosphatase 
production, (2) induction of PTH stimulated cAMP, and (3) 
induction of osteocalcin synthesis by the cells. While 
(1) and (2) represent characteristics associated with the 

20 osteoblast phenotype, the ability to synthesize 

osteocalcin is a phenotypic property only displayed by 
mature osteoblasts. Furthermore, to date we have 
observed conversion of W-20 stromal cells to osteoblast- 
like cells only upon treatment with BMPs. In this 

25 manner, the in vitro activities displayed by BMP treated 
W-20 cells correlate with the iji vivo bone forming 
activity known for BMPs. 
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Below two in vitro assays useful in comparison 
of BMP activities of novel osteoinductive molecules are 
described. 

B. w-20 Alkal i ne Phosphatase Assay Protocol 
5 W-20 cells are plated into 96 well tissue 

culture plates at a density of 10,000 cells per well in 
200 Ml of media (DME with 10% heat inactivated fetal calf 
serum, 2 mM glutamine and 100 U/ml + 100 /ig/ml 
streptomycin. The cells are allowed to attach overnight 
10 in a 95% air, 5% C0 2 incubator at 37 °C. 

The 200 pi of media is removed from each 
well with a multichannel pipettor and replaced with an 
equal volume of test sample delivered in DME with 10% 
heat inactivated fetal calf serum, 2 mM glutamine and 1% 
15 penicillin-streptomycin. Test substances are assayed in 
triplicate. 

The test samples and standards are allowed 
a 24 hour incubation period with the W-20 indicator 
cells. After the 24 hours, plates are removed from the 
20 37°C incubator and the test media are removed from the 
cells. 

The W-20 cell layers are washed 3 times 
with 200 /il per well of calcium/magnesium free phosphate 
buffered saline and these washes are discarded. 
25 50 Ml of glass distilled water is added to 

each well and the assay plates are then placed on a dry 
ice/ethanol bath for quick freezing. Once frozen, the 
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assay plates are removed from the dry ice/ethanol bath 
and thawed at 37 °C. This step is repeated 2 more times 
for a total of 3 freeze-thaw procedures. Once complete, 
the membrane bound alkaline phosphatase is available for 
5 measurement. 

50 /il of assay mix (50 mM glycine, 0.05% 
Triton X-100, 4 mM MgCl 2 , 5 mM p-nitrophenol phosphate, pH 
=10.3) is added to each assay well and the assay plates 
are then incubated for 30 minutes at 37 °C in a shaking 

10 waterbath at 60 oscillations per minute. 

At the end of the 30 minute incubation, 
the reaction is stopped by adding 100 pi of 0.2 N NaOH to 
each well and placing the assay plates on ice. 

The spectrophotometry absorbance for each 

15 well is read at a wavelength of 405 nanometers. These 
values are then compared to known standards to give an 
estimate of the alkaline phosphatase activity in each 
sample. For example, using known amounts of p- 
nitrophenol phosphate, absorbance values are generated. 

20 This is shown in Table I. 



Table I 

Absorbance Values for Known Standards 
of P-Nitrophenol Phosphate 

25 P-nitropfrenQl phosphate umol?p Mean absorbance (405 nml 

0.000 0 

0.006 0.261 +/- .024 

0.012 0.521 +/- .031 

0.018 0.797 +/- .063 
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0.024 
0.030 



1.074 +/- .061 
1.305 +/- .083 



Absorbance values for known amounts of 
BMP-2 can be determined and converted to finoles of p- 
nitrophenol phosphate cleaved per unit time as shown in 
Table II. 



Table II 

10 Alkaline Phosphatase Values for W-20 Cells 

Treating with BMP-2 

BMP-2 concentration Absorbance Reading umoles substrate 
no /ml A05 nmeters per hour 

0 0.645 0.024 

15 1.56 0.696 ' 0.026 

15 l.ll 0.765 0.029 

6 . 2 5 0.923 0.036 

12.50 1.121 

25.0 1-457 0.058 

20 50.0 1-662 0.06.7 

100.0 1.977 0.080 



30 



These values are then used to compare the 
activities of known amounts of BMP heterodimers to BMP-2 
25 homodimer. 

C. osl-Aocalcin RIA Protocol 

W-20 cells are plated at 10 6 cells per well 
in 24 well multiwell tissue culture dishes in 2 mis of 
DME containing 10% heat inactivated fetal calf serum, 2 
mM glutamine. The cells are allowed to attach overnight 
in an atmosphere of 95% air 5% COj at 37°C. 

The next day the medium is changed to DME 



WO 93/09229 



PCT/US92/09430 



87 

containing 10% fetal calf serum, 2 mM glutamine and the 
test substance in a total volume of 2 ml. Each test 
substance is administered to triplicate wells. The test 
substances are incubated with the W-20 cells for a total 
5 of 96 hours with replacement at 48 hours by the same test 
medias • 

At the end of 96 hours, 50 /*1 of the test 
media is removed from each well and assayed for 
osteocalcin production using a radioimmunoassay for mouse 

10 osteocalcin. The details of the assay are described in 

the kit manufactured by Biomedical Technologies Inc., 378 
Page Street, Stoughton, MA 02072. Reagents for the 
assay are found as product numbers BT-431 (mouse 
osteocalcin standard), BT-432 (Goat anti-mouse 

15 Osteocalcin) , BT-431R (iodinated mouse osteocalcin) , BT- 

415 (normal goat serum) and BT-414 (donkey anti goat 
IgG) . The RIA for osteocalcin synthesized by W-20 cells 
in response to BMP treatment is carried out as described 
in the protocol provided by the manufacturer. 

20 The values obtained for the test samples 

are compared to values for known standards of mouse 
osteocalcin and to the amount of osteocalcin produced by 
W-20 cells in response to challenge with known amounts of 
BMP-2. The values for BMP-2 induced osteocalcin 

25 synthesis by W-20 cells is shown in Table III. 
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Table III 

Osteocalcin Synthesis by W-20 Cells 
bmp-2 concen tration na/ml Osteocalcin Synthesis ng/well 

5 o 0-8 

2 °« 9 

4 0.8 

8 2.2 

16 2.7 

10 31 3 - 2 

62 5.1 

125 6 ' 5 

250 8 *2 

500 9 ' 4 

15 1000 10.0 



EXAMPLE 9 - ROSEN MODIFIE D SAMPATH-REDDI ASSAY 

A modified version of the rat bone formation 
assay described in Sampath and Reddi, Proc. Natl t Acad. 

20 Sci. USA . 80:6591-6595 (1983) is used to evaluate bone 

and/or cartilage activity of BMP proteins. This modified 
assay is herein called the Rosen-modified Sampath-Reddi 
assay. The ethanol precipitation step of the Sampath- 
Reddi procedure is replaced by dialyzing (if the 

25 composition is a solution) or diafiltering (if the 

composition is a suspension) the fraction to be assayed 
against water. The solution or suspension is then 
redissolved in 0.1% TFA, and the resulting solution added 
to 20 mg of rat matrix. A mock rat matrix sample not 

30 treated with the protein serves as a control. This 
material is frozen and lyophilized and the resulting 
powder enclosed in #5 gelatin capsules. The capsules are 
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implanted subcutaneous ly in the abdominal thoracic area 
of 21-49 ay old male Long Evans rats. The implants are 
removed after 7-14 days. Half of each implant is used 
for alkaline phosphatase analysis [see, A. H. Reddi et 
5 al, Proc. Natl. Acad. Scj., 69:1601 (1972)]. 

The other half of each implant is fixed and 
processed for histological analysis. 1 nm 
glycolmethacrylate sections are stained with Von Kossa 
and acid fuschin to score the amount of induced bone and 

10 cartilage formation present in each implant. The terms 
+1 through +5 represent the area of each histological 
section of an implant occupied by new bone and/ or 
cartilage cells and matrix. A score of +5 indicates that 
greater than 50% of the implant is new bone and/or 

15 cartilage produced as a direct result of protein in the 
implant. A score of +4, +3, +2, and +1 would indicate 
that greater than 40%, 30%, 20% and 10% respectively of 
the implant contains new cartilage and/or bone. 

The heterodimeric BMP proteins of this 

20 invention may be assessed for activity on this assay. 

Numerous modifications and variations in 
practice of this invention are expected to occur to those 
skilled in the art. Such modifications and variations 
are encompassed within the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Israel, David 

Wolf man, Neil M. 

(ii) TITLE OF INVENTION: Recombinant Bone Morphogenetic Protein 
Heterodimers , Compositions and Methods of Use. 

(iii) NUMBER OF SEQUENCES: 30 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Legal Affairs, Genetics Institute, Inc. 

(B) STREET: 87 CambridgePark Drive 

(C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY : USA 

(F) ZIP: 02140-2387 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Tape 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Kapinos, Ellen J. 

(B) REGISTRATION NUMBER: 32,245 

(C) REFERENCE/DOCKET NUMBER: GI-5192B 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-876-1170 

(B) TELEFAX: 617-876-5851 



2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 356. ,1543 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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GTCGACTCTA 


GAGTGTGTGT 


CAGCACTTGG 


CTGGGGACTT 


CTTGAACTTG 


C AG GGAGAAT 


OU 


AACTTGCGCA 


CCCCACTTTG 


CGCCGGTGCC 


TTTGCCCCAG 


CGGAGCCTGC 


TTCGCCATCT 


It u 


CCGAGCCCCA 


CCGCCCCTCC 


ACTCCTCGGC 


CTTGCCCGAC 


ACTGAGACGC 


TGTTCCCAGC 








GCCGGCACCC 


GGGA(jAAGGA 


GGAGGCAAAG 


AAAAGGAACG 


240 


GACATTCGGT 


CCTTGCGCCA 


GGTCCTTTGA 


CCAGAGTTTT 


TCCATGTGGA 


CGCTCTTTCA 


300 


ATGGACGTGT 


CCCCGCGTGC 


TTCTTAGACG 


GACTGCGGTC 


TCCTAAAGGT 


CGACC ATG 


358 



Met 
1 

GTG GCC GGG ACC CGC TGT CTT CTA GCG TTG CTG CTT CCC CAG GTC CTC 4 06 

Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val Leu 
5 10 15 

CTG GGC GGC GCG GCT GGC CTC GTT CCG GAG CTG GGC CGC AGG AAG TTC 4 54 

Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys Phe 
20 25 30 

• GCG GCG GCG TCG TCG GGC CGC CCC TCA TCC CAG CCC TCT GAC GAG GTC 502 
Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu Val - 
35 40 45 

CTG AGC GAG TTC GAG TTG CGG CTG CTC AGC ATG TTC GGC CTG AAA CAG 550 
Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys Gin 
50 55 60 - 65 

AG A CCC ACC CCC AGC AGG GAC GCC GTG GTG CCC CCC TAC ATG CTA GAC 598 
Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu Asp 
70 75 B0 

CTG TAT CGC AGG CAC TCA GGT CAG CCG GGC TCA CCC GCC CCA GAC CAC 64 6 

Leu Tyr Arg Arg His Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp His 
85 90 95 

CGG TTG GAG AGG GCA GCC AGC CGA GCC AAC ACT GTG CGC AGC TTC CAC 694 
Arg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe His 
100 105 110 

CAT GAA GAA TCT TTG GAA GAA CTA CCA GAA ACG AGT GGG AAA ACA ACC 74 2 

His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr Thr 
115 120 125 

CGG AGA TTC TTC TTT AAT TTA AGT TCT ATC CCC ACG GAG GAG TTT ATC 7 90 

Arg Arg Phe Phe Phe Asn Leu Ser Ser lie Pro Thr Glu Glu Phe lie 
130 135 140 145 

ACC TCA GCA GAG CTT CAG GTT TTC CGA GAA CAG ATG CAA GAT GCT TTA 83 8 

Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin Met Gin Asp Ala Leu 
150 155 160 

GGA AAC AAT AGC AGT TTC CAT CAC CGA ATT AAT ATT TAT GAA ATC ATA 886 
Gly Asn Asn Ser Ser Phe His His Arg He Asn He Tyr Glu He He 
165 170 175 

AAA CCT GCA ACA GCC AAC TCG AAA TTC CCC GTG ACC AGA CTT TTG GAC 93 4 

Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu Asp 
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180 185 190 



ACC AGG TTG GTG AAT CAG AAT GCA AGC AGG TGG GAA ACT TOT GAT GTC 982 
Thr Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Thr Phe Asp Val 
195 200 205 



ACC CCC GCT GTG ATG CGG TGG ACT GCA CAG GGA CAC GCC AAC CAT GGA 1030 
Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His Gly 
210 215 220 2 " 

TTC GTG GTG GAA GTG GCC CAC TTG GAG GAG AAA CAA GGT GTC TCC AAG 1078 

Phe Val val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser Lys * 
230 225 240 

AGA CAT GTT AGG ATA AGC AGG TCT TTG CAC CAA GAT GAA CAC AGC TGG 1126 
Arg His Val Arg He Ser Arg Ser Leu His Gin Asp Glu His Ser Trp 
245 250 255 

TCA CAG ATA AGG CCA TTG CTA GTA ACT TTT GGC CAT GAT GGA AAA GGG 1174 
Ser Gin He Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys Gly 
260 265 270 

CAT CCT CTC CAC AAA AGA GAA AAA CGT CAA GCC AAA CAC AAA CAG CGG 1222 
His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His Lys Gin Arg 
275 280 2B5 

AAA CGC CTT AAG TCC AGC TGT AAG AGA CAC CCT TTG TAC GTG GAC TTC 1270 
Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp Phe 
290 295 300 305 

AGT GAC GTG GGG TGG AAT GAC TGG ATT GTG GCT CCC CCG GGG TAT CAC 1318 
Ser Asp Val Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr His 
310 315 320 

GCC TTT TAC TGC CAC GGA GAA TGC CCT TTT CCT CTG GCT GAT CAT CTG 1366 
Ala Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His Leu 
325 330 335 

AAC TCC ACT AAT CAT GCC ATT GTT CAG ACG TTG GTC AAC TCT GTT AAC 1414 
Asn Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser Val Asn 
340 345 350 

TCT AAG ATT CCT AAG GCA TGC TGT GTC CCG ACA GAA CTC AGT GCT ATC 1462 
Ser Lys He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He 
355 360 365 

TCG ATG CTG TAC CTT GAC GAG AAT GAA AAG GTT GTA TTA AAG AAC TAT 1510 

Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn Tyr 
370 375 380 385 

CAG GAC ATG GTT GTG GAG GGT TGT GGG TGT CGC TAGTACAGCA AAATTAAATA 1563 
Gin Asp Met Val Val Glu Gly Cys Gly Cys Arg 
390 395 

CATAAATATA TATATATATA TATATTTTAG AAAAAAGAAA AAAA 1607 * 

(2) INFORMATION FOR SEQ ID NO: 2: s 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 396 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 
15 10 15 

Leu Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys 
20 25 30 

Phe Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu 
35 40 45 

Val Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys 
50 55 60 

Gin Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu 

65 70 . 75 80 

Asp Leu Tyr Arg Arg His Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp 
85 90 95 

His Arg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe 
100 105 110 

His His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr 
115 120 125 

Thr Arg Arg Phe Phe Phe Asn Leu Ser Ser lie Pro Thr Glu Glu Phe 
130 135 140 

He Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin Met Gin Asp Ala 
145 150 155 160 

Leu Gly Asn Asn Ser Ser Phe His His Arg He Asn He Tyr Glu He 
165 170 175 

He Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu 
180 185 190 

Asp Thr Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Thr Phe Asp 
195 200 205 

Val Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His 
210 215 220 

Gly Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser 
225 230 235 ' 240 

Lys Arg His Val Arg He Ser Arg Ser Leu His Gin Asp Glu His Ser 
245 250 255 

Trp Ser Gin He Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys 
260 265 270 

Gly His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His Lys Gin 
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275 280 285 

Arg Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp 
290 295 300 

Phe Ser Asp Val Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr 
305 310 315 320 

His Ala Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His 
325 • 330 335 

Leu Asn Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser Val 
340 345 350 

Asn Ser Lys He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala 
355 360 365 

He Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn 
370 375 380 

Tyr Gin Asp Met Val Val Glu Gly Cys Gly Cys Arg 
385 390 . 395 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 403. • 1626 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTCTAGAGGG CAGAGGAGGA GGGAGGGAGG GAAGGAGCGC GGAGCCCGGC CCGGAAGCTA 60 

GGTGAGTGTG GCATCCGAGC TGAGGGACGC GAGCCTGAGA CGCCGCTGCT GCTCCGGCTG 120 

AGTATCTAGC TTGTCTCCCC GATGGGATTC CCGTCCAAGC TATCTCGAGC CTGCAGCGCC 180 

ACAGTCCCCG GCCCTCGCCC AGGTTCACTG CAACCGTTCA GAGGTCCCCA GGAGCTGCTG 240 

CTGGCGAGCC CGCTACTGCA GGGACCTATG GAGCCATTCC GTAGTGCCAT CCCGAGCAAC 300 

GCACTGCTGC AGCTTCCCTG AGCCTTTCCA GCAAGTTTGT TCAAGATTGG CTGTCAAGAA 3 60 

TCATGGACTG TTATTATATG CCTTGTTTTC TGTCAAGACA CC ATG ATT CCT GGT 414 r 

Met He Pro Gly 
1 

AAC CGA ATG CTG ATG GTC GTT TTA TTA TGC CAA GTC CTG CTA GGA GGC 462 * 

Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly 
5 10 15 20 
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GCG AGC CAT GCT AGT TTG ATA CCT GAG ACG GGG AAG AAA AAA GTC GCC 510 
Ala Ser His Ala Ser Leu He Pro Glu Thr Gly Lys Lys Lys Val Ala 
25 30 35 

GAG ATT CAG GGC CAC GCG GGA GGA CGC CGC TCA GGG CAG AGC CAT GAG 558 
Glu He Gin Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu 
40 45 50 

CTC CTG CGG GAC TTC GAG GCG ACA CTT CTG CAG ATG TTT GGG CTG CGC 606 
Leu Leu Arg Asp Phe Glu Ala Thr Leu Leu Gin Met Phe Gly Leu Arg 
55 60 65 

CGC CGC CCG CAG CCT AGC AAG AGT GCC GTC ATT CCG GAC TAC ATG CGG 654 
Arg Arg Pro Gin Pro Ser Lys Ser Ala Val He Pro Asp Tyr Met Arg 
70 75 so 

GAT CTT TAC CGG CTT CAG TCT GGG GAG GAG GAG GAA GAG CAG ATC CAC 7 02 

Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu Glu Gin He His 
85 90 95 100 

AGC ACT GGT CTT GAG TAT CCT GAG CGC CCG GCC AGC CGG GCC AAC ACC 7 50 

Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser Arg Ala Asn Thr 
105 no 115 

GTG AGG AGC TTC CAC CAC GAA GAA CAT CTG GAG AAC ATC CCA GGG ACC 7 98 

Val Arg Ser Phe His His Glu Glu His Leu Glu Asn He Pro Gly Thr 
120 125 130 

AGT GAA AAC TCT GCT TTT CGT TTC CTC TTT AAC CTC AGC AGC ATC CCT 84 6 

Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu Ser Ser He Pro 
135 140 145 

GAG AAC GAG GTG ATC TCC TCT GCA GAG CTT CGG CTC TTC CGG GAG CAG 894 
Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu Phe Arg Glu Gin 
150 155 160 

GTG GAC CAG GGC CCT GAT TGG GAA AGG GGC TTC CAC CGT ATA AAC ATT 94 2 

Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His Arg He Asn He 
165 170 175 ' 180 

TAT GAG GTT ATG AAG CCC CCA GCA GAA GTG GTG CCT GGG CAC CTC ATC 990 
Tyr Glu Val Met Lys Pro Pro Ala Glu Val Val Pro Gly His Leu He 
185 190 195 

ACA CGA CTA CTG GAC ACG AGA CTG GTC CAC CAC AAT GTG ACA CGG TGG 10 3 8 

Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn Val Thr Arg Trp 
200 205 210 

GAA ACT TTT GAT GTG AGC CCT GCG GTC CTT CGC TGG ACC CGG GAG AAG 108 6 

Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp Thr Arg Glu Lys 
215 220 225 

CAG CCA AAC TAT GGG CTA GCC ATT GAG GTG ACT CAC CTC CAT CAG ACT 1134 
Gin Pro Asn Tyr Gly Leu Ala He Glu Val Thr His Leu His Gin Thr 
230 235 240 

CGG ACC CAC CAG GGC CAG CAT GTC AGG ATT AGC CGA TCG TTA CCT CAA ns2 
Arg Thr His Gin Gly Gin His Val Arg He Ser Arg Ser Leu Pro Gin 
245 250 255 " 260 



WO 93/09229 



PCT/US92/09430 



96 

GGG ACT GGG AAT TGG GCC CAG CTC CGG CCC CTC CTG GTC ACC m GGC 1230 
Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu Val Thr Phe Gly 
265 270 

CAT GAT GGC CGG GGC CAT GCC TTG ACC CGA CGC CGG AGG GCC AAG CGT 1278 
His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg Arg Ala Lys Arg 
280 285 Z9W 

AGC CCT AAG CAT CAC TCA CAG CGG GCC AGG AAG AAG AAT AAG AAC TGC 1326 
ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys 
295 300 305 

rcc CGC CAC TCG CTC TAT GTG GAC TTC AGC GAT GTG GGC TGG AAT GAC 1374 
Arg A?g Ss Ser SS Tyr Sal Asp Phe Ser Asp Val Gly Trp Asn Asp 
310 315 320 

TGG ATT GTG GCC CCA CCA GGC TAC CAG GCC TTC TAC TGC CAT GGG GAC 1422 
Trp lie val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly Asp 

32S 330 33 5 340 

TGC CCC TTT CCA CTG GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC ATT 1470 
Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala He 
345 . 350 355 

GTG CAG ACC CTG GTC AAT TCT GTC AAT TCC ACT ATC CCC AAA GCC TGT 1518 
Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He Pro Lys Ala Cys 
360 365 370 

TGT GTG CCC ACT GAA CTG.AGT GCC ATC TCC ATG CTG TAC CTG GAT GAG 1566 
Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp Glu 
375 380 385 

TAT GAT AAG GTG GTA CTG AAA AAT TAT CAG GAG ATG GTA GTA GAG GGA 1614 
Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met Val Val Glu Gly 
390 395 400 

TGT GGG TGC CGC TGAGATCAGG CAGTCCTTGA GGATAGACAG ATATACACAC 1666 

Cys Gly Cys Arg 

405 

CACACACACA CACCACATAC ACCACACACA CACGTTCCCA TCCACTCACC CACACACTAC 1726 

ACAGACTGCT TCCTTATAGC TGGACTTTTA TTTAAAAAAA AAAAAAAAAA AATGGAAAAA 1786 

ATCCCTAAAC ATTCACCTTG ACCTTATTTA TGACTTTACG TGCAAATGTT TTGACCATAT 184 6 

TGATCATATA TTTTGACAAA ATATATTTAT AACTACGTAT TAAAAGAAAA AAATAAAATG 1906 

AGTCATTATT TTAAAAAAAA AAAAAAAACT CTAGAGTCGA CGGAATTC 1954 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 408 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met He Pro Gly Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val 
1 5 10 15 

Leu Leu Gly Gly Ala Ser His Ala Ser Leu He Pro Glu Thr Gly Lys 
20 25 30 

Lys Lys Val Ala Glu He Gin Gly His Ala Gly Gly Arg Arg Ser Gly 
35 40 45 

Gin Ser His Glu Leu Leu Arg Asp Phe Glu Ala Thr Leu Leu Gin Met 
50 55 60 

Phe Gly Leu Arg Arg Arg Pro Gin Pro Ser Lys Ser Ala Val He Pro 
65 70 75 80 

Asp Tyr Met Arg Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu 
85 * 90 95 

. Glu Gin He His Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser 
100 105 110 

Arg Ala Asn Thr Val Arg Ser Phe His His Glu Glu His Leu Glu Asn 
115 120 125 

■ He Pro Gly Thr Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu 
130 135 140 

Ser Ser He Pro Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu 
145 150 155 160 

Phe Arg Glu Gin Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His 
165 170 175 

Arg He Asn He Tyr Glu Val Met Lys Pro Pro Ala Glu Val Val Pro 
180 185 190 

Gly His Leu He Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn 
195 200 205 

Val Thr Arg Trp Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp 
210 215 220 

Thr Arg Glu Lys Gin Pro Asn Tyr Gly Leu Ala He Glu Val Thr His 
225 230 235 240 

Leu His Gin Thr Arg Thr His Gin Gly Gin His Val Arg He Ser Arg 
245 250 255 

Ser Leu Pro Gin Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu 
260 265 270 

Val Thr Phe Gly His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg 
275 280 285 

Arg Ala Lys Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys 
290 295 300 
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Asn Lys Asn Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val 
305 310 315 32U 

Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr 



325 



Cys His Gly Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr 
340 345 350 

Asn His Ala lie Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He 
355 360 365 

Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu 
370 ~ 375 380 

Tyr Leu Asp Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met 
385 390 395 

Val Val Glu Gly Cys Gly Cys Arg 
405 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 97.. 1389 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GTGACCGAGC GGCGCGGACG GCCGCCTGCC CCCTCTGCCA CCTGGGGCGG TGCGGGCCCG 

GAGCCCGGAG CCCGGGTAGC GCGTAGAGCC GGCGCG ATG CAC GTG CGC TCA CTG 

Met His Val Arg Ser Leu 
1 5 

CGA GCT GCG GCG CCG CAC AGC TTC GTG GCG CTC TGG GCA CCC CTG TTC 
Arg Ala Ala Ala Pro His Ser Phe Val Ala Leu Trp Ala Pro Leu Phe 
10 15 20 

CTG CTG CGC TCC GCC CTG GCC GAC TTC AGC CTG GAC AAC GAG GTG CAC 
Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn Glu Val His 
25 30 35 

TCG AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG CGG GAG ATG 
Ser Ser Phe He His Arg Arg Leu Arg Ser Gin Glu Arg Arg Glu Met 
40 45 50 

CAG CGC GAG ATC CTC TCC ATT TTG GGC TTG CCC CAC CGC CCG CGC CCG 
Gin Arg Glu He Leu Ser He Leu Gly Leu Pro His Arg Pro Arg Pro 
55 60 65 70 



WO 93/09229 



PCI7US92/09430 



99 

CAC CTC CAG GGC AAG CAC AAC TCG GCA CCC ATG TTC ATG CTG GAC CTG 3 54 

His Leu Gin Gly Lys His Asn Ser Ala Pro Met Phe Met Leu Asp Leu 
75 80 85 

TAC AAC GCC ATG GCG GTG GAG GAG GGC GGC GGG CCC GGC GGC CAG GGC 4 02 

Tyr Asn Ala Met Ala Val Glu Glu Gly Gly Gly Pro Gly Gly Gin Gly 
90 95 100 

TTC TCC TAC CCC TAC AAG GCC GTC TTC AGT ACC CAG GGC CCC CCT CTG 4 50 

Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr Gin Gly Pro Pro Leu 
105 HO 115 

GCC AGC CTG CAA GAT AGC CAT TTC CTC ACC GAC GCC GAC ATG GTC ATG 4 98 

Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp Ala Asp Met Val Met 
120 125 130 

AGC TTC GTC AAC CTC GTG GAA CAT GAC AAG GAA TTC TTC CAC CCA CGC 54 6 

Ser Phe Val Asn Leu Val Glu His Asp Lys Glu Phe Phe His Pro Arg 
135 140 145 150 

TAC CAC CAT CGA GAG TTC CGG TTT GAT CTT TCC AAG ATC CCA GAA GGG 594 
Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser Lys lie Pro Glu Gly 
155 160 165 

GAA GCT GTC ACG GCA GCC GAA TTC CGG ATC TAC AAG GAC TAC ATC CGG 64 2 

Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr Lys Asp Tyr lie Arg 
170 175 180 

GAA CGC TTC GAC AAT GAG ACG TTC CGG ATC AGC GTT TAT CAG GTG CTC 69 0 

Glu Arg Phe Asp Asn Glu Thr Phe Arg He Ser Val Tyr Gin Val Leu 
185 190 195 

CAG GAG CAC TTG GGC AGG GAA TCG GAT CTC TTC CTG CTC GAC AGC CGT 7 38 

Gin Glu His Leu Gly Arg Glu Ser Asp Leu Phe Leu Leu Asp Ser Arg 
200 205 210 

ACC CTC TGG GCC TCG GAG GAG GGC TGG CTG GTG TTT GAC ATC ACA GCC 78 6 

Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu Val Phe Asp He Thr Ala 
215 220 225 230 

ACC AGC AAC CAC TGG GTG GTC AAT CCG CGG CAC AAC CTG GGC CTG CAG 834 
Thr Ser Asn His Trp Val Val Asn Pro Arg His Asn Leu Gly Leu Gin 
235 240 245 

CTC TCG GTG GAG ACG CTG GAT GGG CAG AGC ATC AAC CCC AAG TTG GCG 88 2 

Leu Ser Val Glu Thr Leu Asp Gly Gin Ser He Asn Pro Lys Leu Ala 
250 255 260 

GGC CTG ATT GGG CGG CAC GGG CCC CAG AAC AAG CAG CCC TTC ATG GTG 93 0 

Gly Leu He Gly Arg His Gly Pro Gin Asn Lys Gin Pro Phe Met Val 
265 270 275 

GCT TTC TTC AAG GCC ACG GAG GTC CAC TTC CGC AGC ATC CGG TCC ACG 978 
Ala Phe Phe Lys Ala Thr Glu Val His Phe Arg Ser He Arg Ser Thr 
280 285 290 

GGG AGC AAA CAG CGC AGC CAG AAC CGC TCC AAG ACG CCC AAG AAC CAG 102 6 

Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro Lys Asn Gin 
295 300 305 310 
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GAA GCC CTG CGG ATG GCC AAC GTG GCA GAG AAC AGC AGC AGC GAC CAG 
Glu Ala Leu Arg Met Ala Asn Val Ala Glu Asn Ser Ser Ser Asp Gin 



1074 



1122 



315 " 3 20 325 

AGG CAG GCC TGT AAG AAG CAC GAG CTG TAT GTC AGC TTC CGA GAC CTG 
Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu 
330 335 

GGC TGG CAG GAC TGG ATC ATC GCG CCT GAA GGC TAC GCC GCC TAC TAC 1170 
Gly Trp Gin Asp Trp He lie Ala Pro Glu Gly Tyr Ala Ala Tyr Tyr 
* 345 350 355 

TGT GAG GGG GAG TGT GCC TTC CCT CTG AAC TCC TAC ATG AAC GCC ACC 1218 
Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn Ala Thr 
360 365 370 

AAC CAC GCC ATC GTG CAG ACG CTG GTC CAC TTC ATC AAC CCG GAA ACG 1266 
£n S SS 55 Gin Thr Leu Val His Phe He Asn Pro Glu Thr 

375 380 385 

GTG CCC AAG CCC TGC TGT GCG CCC ACG CAG CTC AAT GCC ATC TCC GTC 
Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He Ser Val 

CTC TAC TTC GAT GAC AGC TCC AAC GTC ATC CTG AAG AAA TAC AGA AAC 
£u 3£r P?e Sp Asp fer Ser Asn Val lie Leu Lys Lys Tyr Arg Asn 
410 415 

ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCCTCC GAGAATTCAG 
Met Val Val Arg Ala Cys Gly Cys His 
425 430 

ACCCTTTGGG GCCAAGTTTT TCTGGATCCT CCATTGCTC 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE': protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 
15 1° X ° 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 
20 25 SO 

Leu Asp Asn Glu Val His Ser Ser Phe He His Arg Arg Leu Arg Ser 
35 40 45 

Gin Glu Arg Arg Glu Met Gin Arg Glu He Leu Ser He Leu Gly Leu 
50 55 60 

Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 



1314 



1362 



1409 



1448 
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65 



70 



75 



80 



Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly 
85 * 90 95 

Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser 
100 105 110 

Thr Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr 
115 120 125 

Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys 
130 135 140 

Glu Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu 
145 150 155 160 

Ser Lys He Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He 
165 * 170 175 

Tyr Lys Asp Tyr He Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg He 
180 185 190 

Ser Val Tyr Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu 
195 * 200 205 

Phe Leu Leu Asp Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu 
210 215 220 

Val Phe Asp He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg 
225 230 235 240 

His Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser 



He Asn Pro Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin Asn 
260 265 270 

Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe 
275 280 285 

Arg Ser He Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser 
290 295 300 

Lys Thr Pro Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu 
305 310 315 320 

Asn Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr 
325 330 335 

Val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu 
340 4 345 350 

Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn 
355 * 360 365 

Ser Tyr Met Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His 
370 375 380 

Phe He Asn Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin 



245 



250 



255 
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385 390 395 400 

Leu Asn Ala lie Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He 
405 410 415 

Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 43 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: -Homo sapiens 

(F) TISSUE TYPE: Human placenta 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Stratagene catalog #93 6203 Human placenta 

cDNA library 

(B) CLONE: BMP6C35 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 160.. 1701 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1282.. 1698 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 1..2923 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGACCATGAG AGATAAGGAC TGAGGGCCAG GAAGGGGAAG CGAGCCCGCC GAGAGGTGGC 60 

GGGGACTGCT CACGCCAAGG GCCACAGCGG CCGCGCTCCG GCCTCGCTCC GCCGCTCCAC 120 

GCCTCGCGGG ATCCGCGGGG GCAGCCCGGC CGGGCGGGG ATG CCG GGG CTG GGG 174 

Met Pro Gly Leu Gly 
-374 -370 

CGG AGG GCG CAG TGG CTG TGC TGG TGG TGG GGG CTG CTG TGC AGC TGC 222 
Arg Arg Ala Gin Trp Leu Cys Trp Trp Trp Gly Leu Leu Cys Ser Cys 

-365 -360 -355 , 

TGC GGG CCC CCG CCG CTG CGG CCG CCC TTG CCC GCT GCC GCG GCC GCC 270 
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Cys Gly Pro Pro Pro Leu Arg Pro Pro Leu Pro Ala Ala Ala Ala Ala 
-350 -345 -340 

GCC GCC GGG GGG CAG CTG CTG GGG GAC GGC GGG AGC CCC GGC CGC ACG 3i 8 
Ala Ala Gly Gly Gin Leu Leu Gly Asp Gly Gly Ser Pro Gly Arg Thr 
-335 -330 -325 

GAG CAG CCG CCG CCG TCG CCG CAG TCC TCC TCG GGC TTC CTG TAC CGG 3 66 

Glu Gin Pro Pro Pro Ser Pro Gin Ser Ser Ser Gly Phe Leu Tyr Arg 
-320 -315 -310 

CGG CTC AAG ACG CAG GAG AAG CGG GAG ATG CAG AAG GAG ATC TTG TCG 414 
Arg Leu Lys Thr Gin Glu Lys Arg Glu Met Gin Lys Glu He Leu Ser 
-305 -300 -295 -290 

GTG CTG GGG CTC CCG CAC CGG CCC CGG CCC CTG CAC GGC CTC CAA CAG 4 62 

Val Leu Gly Leu Pro His Arg Pro Arg Pro Leu His Gly Leu Gin Gin 
-285 -280 ~ -275 

CCG CAG CCC CCG GCG CTC CGG CAG CAG GAG GAG CAG CAG CAG CAG CAG 510 
Pro Gin Pro Pro Ala Leu Arg Gin Gin Glu Glu Gin Gin Gin Gin Gin 
-270 -265 -260 

CAG CTG CCT CGC GGA GAG CCC CCT CCC GGG CGA CTG AAG TCC GCG CCC 558 
Gin Leu Pro Arg Gly Glu Pro Pro Pro Gly Arg Leu Lys Ser Ala Pro 
-255 -250 -245 



CTC 


TTC 


ATG 


CTG 


GAT 


CTG TAC AAC 


GCC 


CTG 


TCC 


GCC 


GAC AAC 


GAC 


GAG 


606 


Leu 


Phe 


Met 


Leu 


Asp 


Leu Tyr Asn 


Ala 


Leu 


Ser 


Ala 


Asp Asn 


Asp 


Glu 




-240 . 






-235 








-230 






GAC 


GGG 


GCG 


TCG 


GAG 


GGG GAG AGG 


CAG 


CAG 


TCC 


TGG 


CCC CAC 


GAA 


GCA 


654 


Asp 


Gly Ala 


Ser 


Glu 


Gly Glu Arg 


Gin 


Gin 


Ser 


Trp 

5 


Pro His 


Glu 


Ala 


-225 








-220 






-21! 






-210 




GCC 


AGC 


TCG 


TCC 


CAG 


CGT CGG CAG 


CCG 


CCC 


CCG 


GGC 


GCC GCG 


CAC 


CCG 


702 


Ala 


Ser 


Ser 


Ser 


Gin 


Arg Arg Gin 


Pro 


Pro 


Pro 


Gly Ala Ala His 


Pro 










-205 




-200 






-195 




CTC 


AAC 


CGC 


AAG 


AGC 


CTT CTG GCC 


CCC 


GGA 


TCT 


GGC 


AGC GGC 


GGC 


GCG 


750 


Leu 


Asn 


Arg 


Lys 


Ser 


Leu Leu Ala 


Pro 


Gly 


Ser 


Gly 


Ser Gly Gly 


Ala 








-190 




-185 






-180 






TCC 


CCA 


CTG 


ACC 


AGC 


GCG CAG GAC 


AGC 


GCC 


TTC 


CTC 


AAC GAC 


GCG 


GAC 


798 


Ser 


Pro 


Leu 


Thr 


Ser 


Ala Gin Asp Ser 


Ala 


Phe 


Leu 


Asn Asp 


Ala 


Asp 






-175 




-170 








-165 






ATG 


GTC 


ATG 


AGC 


TTT 


GTG AAC CTG 


GTG 


GAG 


TAC 


GAC 


AAG GAG 


TTC 


TCC 


846 


Met 


Val 


Met 


Ser 


Phe 


Val Asn Leu 


Val 


Glu 


Tyr 


Asp 


Lys Glu 

) 


Phe 


Ser 






-160 






-155 








-15C 








CCT 


CGT 


CAG 


CGA 


CAC 


CAC AAA GAG 


TTC 


AAG 


TTC 


AAC 


TTA TCC 


CAG 


ATT 


894 


Pro 


Arg Gin 


Arg 


His 


His Lys Glu 


Phe 


Lys 


Phe 


Asn 


Leu Ser 


Gin 


He 




-145 








-140 






-135 






-130 




CCT 


GAG 


GGT 


GAG 


GTG 


GTG ACG GCT 


GCA 


GAA 


TTC 


CGC 


ATC TAC 


AAG 


GAC 


942 


Pro 


Glu 


Gly 


Glu 


Val 


Val Thr Ala 


Ala 


Glu 


Phe 


Arg 


He Tyr 


Lys 


Asp 










-125 




-120 




-115 





TGT GTT ATG GGG AGT TTT AAA AAC CAA ACT TTT CTT ATC AGC ATT TAT 



990 
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Cys Val Met Gly Ser Phe Lys Asn Gin Thr Phe Leu lie Ser lie Tyr 
-110 -105 -100 

CAA GTC TTA CAG GAG CAT CAG CAC AG A GAC TCT GAC CTG TTT TTG TTG 1038 
Gin Val Leu Gin Glu His Gin His Arg Asp Ser Asp Leu Phe Leu Leu 
-95 -90 -85 

GAC ACC CGT GTA GTA TGG GCC TCA GAA GAA GGC TGG CTG GAA TTT GAC 1086 
Asp Thr Arg Val Val Trp Ala Ser Glu Glu Gly Trp Leu Glu Phe Asp 
-80 -75 -70 

ATC ACG GCC ACT AGC AAT CTG TGG GTT GTG ACT CCA CAG CAT AAC ATG 1134 
He Thr Ala Thr Ser Asn Leu Trp Val Val Thr Pro Gin His Asn Met 
-65 -60 -55 -50 

GGG CTT CAG CTG AGC GTG GTG ACA AGG GAT GGA GTC CAC GTC CAC CCC 1182 
Gly Leu Gin Leu Ser Val Val Thr Arg Asp Gly Val His Val His Pro 
-45 -40 -35 

CGA GCC GCA GGC CTG GTG GGC AGA GAC GGC CCT TAC GAT AAG CAG CCC 1230 
Arg Ala Ala Gly Leu Val Gly Arg Asp Gly Pro Tyr Asp Lys Gin Pro 
-30 -25 • -20 

TTC ATG GTG GCT TTC TTC AAA GTG AGT GAG GTC CAC GTG CGC ACC ACC 1278 
Phe Met Val Ala Phe pfie Lys Val Ser Glu Val His Val Arg Thr Thr 
-15 -10 -5 

AGG TCA GCC TCC AGC CGG CGC CGA CAA CAG AGT CGT AAT CGC TCT ACC 13 26 

Arg Ser Ala Ser Ser Arg Arg Arg Gin Gin Ser Arg Asn Arg Ser Thr 
" 1 5 10 • 15 

CAG TCC CAG GAC GTG GCG CGG GTC TCC AGT GCT TCA GAT TAC AAC AGC 1374 
Gin Ser Gin Asp Val Ala Arg Val Ser Ser Ala Ser Asp Tyr Asn Ser 
20 25 30 

AGT GAA TTG AAA ACA GCC TGC AGG AAG CAT GAG CTG TAT GTG AGT TTC 14 22 

Ser Glu Leu Lys Thr Ala Cys Arg Lys His Glu Leu Tyr Val Ser Phe 
35 40 45 

CAA GAC CTG GGA TGG CAG GAC TGG ATC ATT GCA CCC AAG GGC TAT GCT 1470 
Gin Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Lys Gly Tyr Ala 
50 55 60 

GCC AAT TAC TGT GAT GGA GAA TGC TCC TTC CCA CTC AAC GCA CAC ATG 1518 
Ala Asn Tyr Cys Asp Gly Glu Cys Ser Phe Pro Leu Asn Ala His Met 
65 70 75 

AAT GCA ACC AAC CAC GCG ATT GTG CAG ACC TTG GTT CAC CTT ATG AAC 1566 
Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His Leu Met Asn 
80 85 90 95 

CCC GAG TAT GTC CCC AAA CCG TGC TGT GCG CCA ACT AAG CTA AAT GCC 1614 
Pro Glu Tyr Val Pro Lys Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala 

100 105 110 v 

ATC TCG GTT CTT TAC TTT GAT GAC AAC TCC AAT GTC ATT CTG AAA AAA 1662 
He Ser Val Leu Tyr Phe Asp Asp Asn Ser Asn Val He Leu Lys Lys 

115 ~ 120 125 * 



TAC AGG AAT ATG GTT GTA AGA GCT TGT GGA TGC CAC TAACTCGAAA 
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Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 

130 135 140 

CCAGATGCTG GGGACACACA TTCTGCCTTG GATTCCTAGA TTACATCTGC CTTAAAAAAA 
CACGGAAGCA CAGTTGGAGG TGGGACGATG AGACTTTGAA ACTATCTCAT GCCAGTGCCT 
TATTACCCAG GAAGATTTTA AAGGACCTCA TTAATAATTT GCTCACTTGG TAAATGACGT 
GAGTAGTTGT TGGTCTGTAG CAAGCTGAGT TTGGATGTCT GTAGCATAAG GTCTGGTAAC 
TGCAGAAACA TAACCGTGAA GCTCTTCCTA CCCTCCTCCC CCAAAAACCC ACCAAAATTA 
GTTTTAGCTG TAGATCAAGC TATTTGGGGT GTTTGTTAGT AAATAGGGAA AATAATCTCA 
AAGGAGTTAA ATGTATTCTT GGCTAAAGGA TCAGCTGGTT CAGTACTGTC TATCAAAGGT 
AGATTTTACA GAGAACAGAA ATCGGGGAAG TGGGGGGAAC GCCTCTGTTC AGTTCATTCC 
CAGAAGTCCA CAGGACGCAC AGCCCAGGCC ACAGCCAGGG CTCCACGGGG CGCCCTTGTC 
TCAGTCATTG CTGTTGTATG TTCGTGCTGG AGTTTTGTTG GTGTGAAAAT ACACTTATTT 
CAGCCAAAAC ATACCATTTC TACACCTCAA TCCTCCATTT GCTGTACTCT TTGCTAGTAC 
CAAAAGTAGA CTGATTACAC TGAGGTGAGG CTACAAGGGG TGTGTAACCG TGTAACACGT 
GAAGGCAGTG CTCACCTCTT CTTTACCAGA ACGGTTCTTT GACCAGCACA TTAACTTCTG 
GACTGCCGGC TCTAGTACCT TTTCAGTAAA GTGGTTCTCT GCCTTTTTAC TATACAGCAT 
ACCACGCCAC AGGGTTAGAA CCAACGAAGA AAATAAAATG AGGGTGCCCA GCTTATAAGA 
ATGGTGTTAG GGGGATGAGC ATGCTGTTTA TGAACGGAAA TCATGATTTC CCTGTAGAAA 
GTGAGGCTCA GATTAAATTT TAGAATATTT TCTAAATGTC TTTTTCACAA TCATGTGACT 
GGGAAGGCAA TTTCATACTA AACTGATTAA ATAATACATT TATAATCTAC AACTGTTTGC 
ACTTACAGCT TTTTTTGTAA ATATAAACTA TAATTTATTG TCTATTTTAT ATCTGTTTTG 
CTGTGGCGTT GGGGGGGGGG CCGGG CTTTT GGGGGGGGGG GTTTGTTTGG GGGGTGTCGT 
GGTGTGGGCG GGCGG 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 513 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Met^Pro Gly Leu Gly Arg Arg Ala Gin Trp Leu Cys Trp Trp Trp Gly 
~ 370 -365 -360 
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1828 

1888 

1948 

2008 

2068 

2128 

2188 

2248 

2308 

2368 

2428 

2488 

2548 

2608 

2668 

2728 

2788 

2848 

2908 

2923 
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Leu Leu Cys Ser Cys Cys Gly Pro Pro Pro Leu Arg Pro Pro Leu Pro 
-355 ~ -350 -345 

Ala Ala Ala Ala Ala Ala Ala Gly Gly Gin Leu Leu Gly Asp Gly Gly 
-340 -335 -330 

Ser Pro Gly Arg Thr Glu Gin Pro Pro Pro Ser Pro Gin Ser Ser Ser 
-325 -320 -315 

Gly Phe Leu Tyr Arg Arg Leu Lys Thr Gin Glu Lys Arg Glu Met Gin 
-310 -305 "300 -295 

Lys Glu He Leu Ser Val Leu Gly Leu Pro His Arg Pro Arg Pro Leu 
-290 * -285 -280 

His Gly Leu Gin Gin Pro Gin Pro Pro Ala Leu Arg Gin Gin Glu Glu 
-275 -270 -265 

Gin Gin Gin Gin Gin Gin Leu Pro Arg Gly Glu Pro Pro Pro Gly Arg 
-260 -255 -250 

Leu Lys Ser Ala Pro Leu Phe Met Leu Asp Leu Tyr Asn Ala Leu Ser 
-245 * -240 -235 

Ala Asp Asn Asp Glu ASp Gly Ala Ser Glu Gly Glu Arg Gin Gin Ser 
-230 -225 -220 -215 

Trp Pro His Glu Ala Ala Ser Ser Ser Gin Arg Arg Gin Pro Pro Pro 
-210 -205 -200 

Gly Ala Ala His Pro Leu Asn Arg Lys Ser Leu Leu Ala Pro Gly Ser 
-195 -190 -185 

Gly Ser Gly Gly Ala Ser Pro Leu Thr Ser Ala Gin Asp Ser Ala Phe 
-180 -175 -170 

Leu Asn Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu Tyr 
-165 -160 -155 

Asp Lys Glu Phe Ser Pro Arg Gin Arg His His Lys Glu Phe £ys Phe 
-150 -145 -140 -135 

Asn Leu Ser Gin He Pro Glu Gly Glu Val Val Thr Ala Ala Glu Phe 
-130 -125 -120 

Arg He Tyr Lys Asp Cys Val Met Gly Ser Phe Lys Asn Gin Thr Phe 
-115 -110 -105 

Leu He Ser He Tyr Gin Val Leu Gin Glu His Gin His Arg Asp Ser 
-100 -95 -90 

Asp Leu Phe Leu Leu Asp Thr Arg Val Val Trp Ala Ser Glu Glu Gly 
-85 -80 -75 

Trp Leu Glu Phe Asp He Thr Ala Thr Ser Asn Leu Trp Val Val Thr 
-70 -65 -60 -55 

Pro Gin His Asn Met Gly Leu Gin Leu Ser Val Val Thr Arg Asp Gly 
-50 -45 -40 
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Val His Val His Pro Arg Ala Ala Gly Leu Val Gly Arg Asp Gly Pro 
-35 -30 -25 

Tyr Asp Lys Gin Pro Phe Met Val Ala Phe Phe Lys Val Ser Glu Val 
-20 -15 -10 

His Val Arg Thr Thr Arg Ser Ala Ser Ser Arg Arg Arg Gin Gin Ser 
" 5 15 10 

Arg Asn Arg Ser Thr Gin Ser Gin Asp Val Ala Arg Val Ser Ser Ala 
15 20 25 

Ser Asp Tyr Asn Ser Ser Glu Leu Lys Thr Ala Cys Arg Lys His Glu 
30 35 40 

Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Gin Asp Trp lie lie Ala 
45 50 55 

Pro Lys Gly Tyr Ala Ala Asn Tyr Cys Asp Gly Glu Cys Ser Phe Pro 
60 65 70 

Leu Asn Ala His Met Asn Ala Thr Asn His Ala He Val Gin Thr Leu 
75 80 85 90 

Val His Leu Met Asn Pro Glu Tyr Val Pro Lys Pro Cys Cys Ala Pro 
95 100 ** 105 

Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe Asp Asp Asn Ser Asn 
110 us 120 

Val He Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys 
125 130 135 

His 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(H) CELL LINE: U2-OS osteosarcoma 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: U2-0S human osteosarcoma cDNA library 

(B) CLONE: U2-16 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 699.. 2063 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1647.. 2060 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 1..2153 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CTGGTATATT TGTGCCTGCT GGAGGTGGAA TTAACAGTAA GAAGGAGAAA GGGATTGAAT 

GGACTTACAG GAAGGATTTC AAGTAAATTC AGGGAAACAC ATTTACTTGA ATAGTACAAC 

CTAGAGTATT ATTTTACACT AAGACGACAC AAAAGATGTT AAAGTTATCA CCAAGCTGCC 

GGACAGATAT ATATTCCAAC ACCAAGGTGC AGATCAGCAT AGATCTGTGA TTCAGAAATC 

AGGATTTGTT TTGGAAAGAG CTCAAGGGTT GAGAAGAACT CAAAAGCAAG TGAAGATTAC 

TTTGGGAACT ACAGTTTATC AGAAGATCAA CTTTTGCTAA TTCAAATACC AAAGGCCTGA 

TTATCATAAA TTCATATAGG AATGCATAGG TCATCTGATC AAATAATATT AGCCGTCTTC 

TGCTACATCA ATGCAGCAAA AACTCTTAAC AACTGTGGAT AATTGGAAAT CTGAGTTTCA 

GCTTTCTTAG AAATAACTAC TCTTGACATA TTCCAAAATA TTTAAAATAG GACAGGAAAA 

TCGGTGAGGA TGTTGTGCTC AGAAATGTCA CTGTCATGAA AAATAGGTAA ATTTGTTTTT 

TCAGCTACTG GGAAACTGTA CCTCCTAGAA CCTTAGGTTT TTTTTTTTTT AAGAGGACAA 

GAAGGACTAA AAATATCAAC TTTTGCTTTT GGACAAAA ATG CAT CTG ACT GTA 

Met His Leu Thr Val 
-316-315 

TTT TTA CTT AAG GGT ATT GTG GGT TTC CTC TGG AGC TGC TGG GTT CTA 
Phe Leu Leu Lys Gly lie Val Gly Phe Leu Trp Ser Cys Trp Val Leu 
-310 -305 -300 

GTG GGT TAT GCA AAA GGA GGT TTG GGA GAC AAT CAT GTT CAC TCC AGT 
Val Gly Tyr Ala Lys Gly Gly Leu Gly Asp Asn His Val His Ser Ser 
-295 -290 -285 -280 

TTT ATT TAT AGA AGA CTA CGG AAC CAC GAA AGA CGG GAA ATA CAA AGG 
Phe He Tyr Arg Arg Leu Arg Asn His Glu Arg Arg Glu He Gin Arg 
-275 -270 -265 

GAA ATT CTC TCT ATC TTG GGT TTG CCT CAC AGA CCC AGA CCA TTT TCA 
Glu He Leu Ser He Leu Gly Leu Pro His Arg Pro Arg Pro Phe Ser 
-260 -255 -250 

CCT GGA AAA ATG ACC AAT CAA GCG TCC TCT GCA CCT CTC TTT ATG CTG 
Pro Gly Lys Met Thr Asn Gin Ala Ser Ser Ala Pro Leu Phe Met Leu 
-245 -240 -235 



GAT CTC TAC AAT GCC GAA GAA AAT CCT GAA GAG TCG GAG TAC TCA GTA 
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Asp Leu Tyr Asn Ala Glu Glu Asn Pro Glu Glu Ser Glu Tyr Ser Val 
-230 -225 -220 

AGG GCA TCC TTG GCA GAA GAG ACC AGA GGG GCA AGA AAG GGA TAC CCA 1049 
Arg Ala Ser Leu Ala Glu Glu Thr Arg Gly Ala Arg Lys Gly Tyr Pro 
-215 -210 -205 -200 

GCC TCT CCC AAT GGG TAT CCT CGT CGC ATA CAG TTA TCT CGG ACG ACT 1097 
Ala Ser Pro Asn Gly Tyr Pro Arg Arg lie Gin Leu Ser Arg Thr Thr 
-195 -190 -185 

CCT CTG ACC ACC CAG AGT CCT CCT CTA GCC AGC CTC CAT GAT ACC AAC 1145 
Pro Leu Thr Thr Gin Ser Pro Pro Leu Ala Ser Leu His Asp Thr Asn 
-180 -175 -170 

TTT CTG AAT GAT GCT GAC ATG GTC ATG AGC TTT GTC AAC TTA GTT GAA 1193 
Phe Leu Asn Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu 
-165 -160 -155 

AGA GAC AAG GAT TTT TCT CAC CAG CGA AGG CAT TAC AAA GAA TTT CGA 1241 
Arg Asp Lys Asp Phe Ser His Gin Arg Arg His Tyr Lys Glu Phe Arg 
-150 -145 -140 

TTT GAT CTT ACC CAA ATT CCT CAT GGA GAG GCA GTG ACA GCA GCT GAA 12 89 

Phe Asp Leu Thr Gin He Pro His Gly Glu Ala Val Thr Ala Ala Glu 
-135 -130 -125 -120 

TTC CGG ATA TAC AAG GAC CGG AGC AAC AAC CGA TTT GAA AAT GAA ACA 13 3 7 

Phe Arg He Tyr Lys Asp Arg Ser Asn Asn Arg Phe Glu Asn Glu Thr 
-115 ~ -110 -105 

ATT AAG ATT AGC ATA TAT CAA ATC ATC AAG GAA TAC ACA AAT AGG GAT 13 8 5 

He Lys He Ser He Tyr Gin He He Lys Glu Tyr Thr Asn Arg Asp 
-100 -95 -90 

GCA GAT CTG TTC TTG TTA GAC ACA AGA AAG GCC CAA GCT TTA GAT GTG 14 3 3 

Ala Asp Leu Phe Leu Leu Asp Thr Arg Lys Ala Gin Ala Leu Asp Val 
-85 -80 -75 

GGT TGG CTT GTC TTT GAT ATC ACT GTG ACC AGC AAT CAT TGG GT6 ATT 14 81 

Gly Trp Leu Val Phe Asp He Thr Val Thr Ser Asn His Trp Val He 
-70 -65 -60 

AAT CCC CAG AAT AAT TTG GGC TTA CAG CTC TGT GCA GAA ACA GGG GAT 1529 
Asn Pro Gin Asn Asn Leu Gly Leu Gin Leu Cys Ala Glu Thr Gly Asp 
-55 -50 -45 -40 

GGA CGC AGT ATC AAC GTA AAA TCT GCT GGT CTT GTG GGA AGA CAG GGA 157 7 

Gly Arg Ser He Asn Val Lys Ser Ala Gly Leu Val Gly Arg Gin Gly 
-35 -30 -25 

CCT CAG TCA AAA CAA CCA TTC ATG GTG GCC TTC TTC AAG GCG AGT GAG 162 5 

Pro Gin Ser Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Ser Glu 
-20 -15 -10 

GTA CTT CTT CGA TCC GTG AGA GCA GCC AAC AAA CGA AAA AAT CAA AAC 167 3 

Val Leu Leu Arg Ser Val Arg Ala Ala Asn Lys Arg Lys Asn Gin Asn 
-5 15 



CGC AAT AAA TCC AGC TCT CAT CAG GAC TCC TCC AGA ATG TCC AGT GTT 
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Arg Asn Lys Ser Ser Ser His Gin Asp Ser Ser Arg Met Ser Ser Val 



10 



GGA GAT TAT AAC ACA AGT GAG CAA AAA CAA GCC TGT AAG AAG CAC GAA 

Gly Asp Tyr Asn Thr Ser Glu Gin Lys Gin Ala Cys Lys tys nis ox 
30 35 

CTC TAT GTG AGC TTC CGG GAT CTG GGA TGG CAG GAC TGG ATT ATA GCA 
SS ?Jr Sal Ser ine Arg Asp Leu Gly Trp Gin Asp Trp lie lie Ala 

CCA GAA GGA TAC GCT GCA TTT TAT TGT GAT GGA GAA TGT TCT TTT CCA 
Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Gly Glu Cys ser Fne 
60 6 5 

CTT AAC GCC CAT ATG AAT GCC ACC AAC CAC GCT ATA GTT CAG ACT CTG 
Leu Asn Ala His Met Asn Ala Thr Asn His Ala lie Val Gin Tftr ieu 
75 80 85 

GTT CAT CTG ATG TTT CCT GAC CAC GTA CCA AAG CCT TGT TGT GCT CCA 
?al Ss Su Me? Te Pro Asp His Val Pro Lys Pro cys Cys Ala Pro 
90 95 * 100 

ACC AAA TTA AAT GCC ATC- TCT GTT CTG TAC TTT GAT GAC AGC TCC AAT 
Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn 
HO H5 

GTC ATT TTG AAA AAA TAT AGA AAT ATG GTA GTA CGC TCA TGT GGC TGC 
Val He Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ser Cys Giy cys 

125 I 30 • 1 

CAC TAATATTAAA TAATATTGAT AATAACAAAA AGATCTGTAT TAAGGTTTAT 
His 

GGCTGCAATA AAAAGCATAC TTTCAGACAA ACAGAAAAAA AAA 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met His Leu Thr Val Phe Leu Leu Lys Gly He Val Gly Phe Leu Trp 
-316 -315 -310 " 305 

Ser cys Trp Val Leu Val Gly Tyr Ala Lys Gly Gly Leu Gly Asp ^_ 2Q5 
-300 -295 -290 

His Val His Ser Ser Phe He Tyr Arg Arg Leu Arg Asn His d^Arg 



-280 _275 
I 

-265 



Arg Glu lie Gin Arg Glu He Leu Senile Leu Gly Leu Prodis Arg 
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Pro Arg Pro Phe Ser Pro Gly Lys Met Thr Asn Gin Ala Ser Ser Ala 
"250 -245 -240 

Pro Leu Phe Met Leu Asp Leu Tyr Asn Ala Glu Glu Asn Pro Glu Glu 
~ 235 -230 -225 

S ~L Glu Tyr Ser Val Ar 9 Ala Ser Leu Ala Glu Glu Thr Arg Gly Ala 
~ 220 -215 -210 ' -205 

Arg Lys Gly Tyr Pro Ala Ser Pro Asn Gly Tyr Pro Arg Arg He Gin 
-200 -195 " -190 

Leu Ser Arg Thr Thr Pro Leu Thr Thr Gin Ser Pro Pro Leu Ala Ser 
"185 -180 -175 

Leu His Asp Thr Asn Phe Leu Asn Asp Ala Asp Met Val Met Ser Phe 
-170 -165 _ 160 

Val Asn Leu Val Glu Arg Asp Lys Asp Phe Ser His Gin Arg Arg His 
-155 -150 -145 

Ty L Lys Glu Phe Ar 9 phe As P Leu Thr G1 " He Pro His Gly Glu Ala 
" 14 ° "I" -130 -125 

Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp Arg Ser Asn Asn Arg 
-12° -H5 " -no 

Phe Glu Asn Glu Thr He Lys He Ser He Tyr Gin He He Lys Glu 
"105 -loo * -95 

Tyr Thr Asn Arg Asp Ala Asp Leu Phe Leu Leu Asp Thr Arg Lys Ala 
_9 ° -85 r -80 

Gin Ala Leu Asp Val Gly Trp Leu Val Phe Asp He Thr Val Thr Ser 
~ 75 -70 -65 

Asn His Trp Val He Asn Pro Gin Asn Asn Leu Gly Leu Gin Leu Cys 
" 60 -55 -50 -45 

Ala Glu Thr Gly Asp Gly Arg Ser He Asn Val Lys Ser Ala Gly Leu 
-40 -35 -3o 

Val Gly Arg Gin Gly Pro Gin Ser Lys Gin Pro Phe Met Val Ala Phe 
"25 -20 -is 

Phe Lys Ala Ser Glu Val Leu Leu Arg Ser Val Arg Ala Ala Asn Lys 
-10 - 5 x 

Arg Lys Asn Gin Asn Arg Asn Lys Ser Ser Ser His Gin Asp Ser Ser 
5 1° 15 20 

Arg Met Ser Ser Val Gly Asp Tyr Asn Thr Ser Glu Gin Lys Gin Ala 
25 30 35 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin 
40 45 50 

Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Glv 
55 60 65 
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Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
70 75 80 

lie val Gin Thr Leu Val His Leu Met Phe Pro Asp His Val Pro Lys 
85 90 95 

Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala lie Ser Val Leu Tyr Phe 
105 H° 1X5 

Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 
120 125 130 

Arg Ser Cys Gly Cys His 
135 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: Human Heart 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Human heart cDNA library stratagene catalog 

#936208 

(B) CLONE: hH38 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 8.. 850 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 427.. 843 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 1..997 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GAATTCC GAG CCC CAT TGG AAG GAG TTC CGC TTT GAC CTG ACC CAG ATC 
Glu Pro His Trp Lys Glu Phe Arg Phe Asp Leu Thr Gin He 
-139 -135 -130 
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CCG GCT GGG GAG GCG GTC ACA GCT GCG GAG TTC CGG ATT TAC AAG GTG 

Pro Ala Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr Lys Val 
-125 -120 -115 -110 

CCC AGC ATC CAC CTG CTC AAC AGG ACC CTC CAC GTC AGC ATG TTC CAG 
Pro Ser lie His Leu Leu Asn Arg Thr Leu His Val Ser Met Phe Gin 
-105 -100 -95 

GTG GTC CAG GAG CAG TCC AAC AGG GAG TCT GAC TTG TTC TTT TTG GAT 
Val Val Gin Glu Gin Ser Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp 
-90 -85 -80 

CTT CAG ACG CTC CGA GCT GGA GAC GAG GGC TGG CTG GTG CTG GAT GTC 
Leu Gin Thr Leu Arg Ala Gly Asp Glu Gly Trp Leu Val Leu Asp Val 
-75 -70 -65 

ACA GCA GCC AGT GAC TGC TGG TTG CTG AAG CGT CAC AAG GAC CTG GGA 
Thr Ala Ala Ser Asp Cys Trp Leu Leu Lys Arg His Lys Asp Leu Gly 
-60 -55 -50 

CTC CGC CTC TAT GTG GAG ACT GAG GAT GGG CAC AGC GTG GAT CCT GGC 
Leu Arg Leu Tyr Val Glu Thr Glu Asp Gly His Ser Val Asp Pro Gly 
-45 -40 -35 -30 

CTG GCC GGC CTG CTG GGT CAA CGG GCC CCA CGC TCC CAA CAG CCT TTC 
Leu Ala Gly Leu Leu Gly Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe 
-25 -20 -15 

GTG GTC ACT TTC TTC AGG GCC AGT CCG AGT CCC ATC CGC ACC CCT CGG 
Val Val Thr* Phe Phe Arg Ala Ser Pro Ser Pro lie Arg Thr Pro Arg 
-10 -5 1 

GCA GTG AGG CCA CTG AGG AGG AGG CAG CCG AAG AAA AGC AAC GAG CTG 
Ala Val Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu 
5 10 15 

CCG CAG GCC AAC CGA CTC CCA GGG ATC TTT GAT GAC GTC CAC GGC TCC 
Pro Gin Ala Asn Arg Leu Pro Gly He Phe Asp Asp Val His Gly Ser 
20 25 30 35 

CAC GGC CGG CAG GTC TGC CGT CGG CAC GAG CTC TAC GTC AGC TTC CAG 
His Gly Arg Gin Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin 
40 45 50 

GAC CTT GGC TGG CTG GAC TGG GTC ATC GCC CCC CAA GGC TAC TCA GCC 
Asp Leu Gly Trp Leu Asp Trp Val He Ala Pro Gin Gly Tyr Ser Ala 
55 60 65 

TAT TAC TGT GAG GGG GAG TGC TCC TTC CCG CTG GAC TCC TGC ATG AAC 
Tyr Tyr Cys Glu Gly Glu Cys Ser Phe Pro Leu Asp Ser Cys Met Asn 
70 75 80 

GCC ACC AAC CAC GCC ATC CTG CAG TCC CTG GTG CAC CTG ATG AAG CCA 
Ala Thr Asn His Ala He Leu Gin Ser Leu Val His Leu Met Lys Pro 
85 90 95 

AAC GCA GTC CCC AAG GCG TGC TGT GCA CCC ACC AAG CTG AGC GCC ACC 
Asn Ala Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr 
100 105 110 115 
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TCT GTG CTC TAC TAT GAC AGC AGC AAC AAC GTC ATC CTG CGC AAG CAC 
Ser val Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg Lys His 
120 125 

CGC AAC ATG GTG GTC AAG GCC TGC GGC TGC CAC TGAGTCAGCC CGCCCAGCCC 
Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
135 14° 

TACTGCAGCC ACCCTTCTCA TCTGGATCGG GCCCTGCAGA GGCAGAAAAC CCTTAAATGC 
TGTCACAGCT CAAGCAGGAG TGTCAGGGGC CCTCACTCTC GGTGCCTACT TCCTGTCAGG 
CTTCTGGGAA TTC 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Glu Pro His Trp Lys Glu Phe Arg Phe Asp Leu Thr Gin He Pro Ala 
-139 -135 ~ 130 A " 

Gly Glu Ala val Thr Ala Ala Glu Phe Arg He Tyr Lys Val Pro Ser 
-120 -115 -xau 

He His Leu Leu Asn Arg Thr Leu His Val Ser Met Phe Gin Val Val 
-105 -10° ~ 95 

Gin Glu Gin Ser Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin 
-90 -85 "80 

Thr Leu Arg Ala Gly Asp Glu Gly Trp Leu Val Leu Asp Val Thr Ala 
-75 -70 "65 ~ bu 

Ala Ser Asp Cys Trp Leu Leu Lys Arg His Lys Asp Leu Gly Leu Arg 
-55 -50 _45 

Leu Tyr Val Glu Thr Glu Asp Gly His Ser Val Asp Pro Gly Leu Ala 
-40 "35 " 30 

Gly Leu Leu Gly Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe Val Val 
-25 -20 ~ 15 

Thr Phe Phe Arg Ala Ser Pro Ser Pro He Arg Thr Pro Arg Ala Val 
-10 -5 15 

Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu Pro Gin 
10 15 20 

Ala Asn Arg Leu Pro Gly He Phe Asp Asp Val His Gly Ser His Gly 
25 30 35 

Arg Gin Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu 



817 
870 

936 
990 
1003 
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40 45 50 

Gly Trp Leu Asp Trp Val He Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr 
55 60 65 

Cys Glu Gly Glu Cys Ser Phe Pro Leu Asp Ser Cys Met Asn Ala Thr 
70 75 80 85 

Asn His Ala He Leu Gin Ser Leu Val His Leu Met Lys Pro Asn Ala 
90 95 100 

Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr Ser Val 
105 110 115 

Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg Lys His Arg Asn 
120 125 130 

Met Val Val Lys Ala Cys Gly Cys His 
135 140 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(v'ii) IMMEDIATE SOURCE: 

(B) CLONE: pALBP2-781 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 2724.. 3071 

(ix) FEATURE: 

(A) NAME/ KEY : terminator 

(B) LOCATION: 3150.. 3218 

(ix) FEATURE: 

(A) NAME/KEY: RBS 

(B) LOCATION: 2222.. 2723 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GACGAAAGGG CCTCGTGATA CGCCTATTTT TATAGGTTAA TGTCATGATA ATAATGGTTT 60 

CTTAGACGTC AGGTGGCACT TTTCGGGGAA ATGTGCGCGG AACCCCTATT TGTTTATTTT 12 0 

TCTAAATACA TTCAAATATG TATCCGCTCA TGAGACAATA ACCCTGATAA ATGCTTCAAT 18 0 

AATATTGAAA AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT 24 0 

TTGCGGCATT TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 300 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC AGCGGTAAGA 360 
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TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT GAGCACTTTT AAAGTTCTGC 
TATGTGGCGC GGTATTATCC CGTATTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC 
ACTATTCTCA GAATGACTTG GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG 
GCATGACAGT AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 
ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG CACAACATGG 
GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC ATACCAAACG 
ACGAGCGTGA CACCACGATG CCTGTAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG 
GCGAACTACT TACTCTAGCT TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG 
TTGCAGGACC ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 
GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT GGTAAGCCCT 
CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC TATGGATGAA CGAAATAGAC 
AGATCGCTGA GATAGGTGCC TCACTGATTA AGCATTGGTA ACTGTCAGAC CAAGTTTACT 
CATATATACT TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGGATC TAGGTGAAGA 
TCCTTTTTGA TAATCTCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT 
CAGACCCCGT AGAAAAGATC AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT 
GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC 
TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTCC 
TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC 
TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG 
GGTTGGACTC AAGACGATAG TTACCGGAXA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT 
CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG 
AGCATTGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG 
GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT 
ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG 
GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT 
GCTGGCCTTT TGCTCACATG TTCTTTCCTG CGTTATCCCC TGATTCTGTG GATAACCGTA 
TTACCGCCTT TGAGTGAGCT GATACCGCTC GCCGCAGCCG AACGACCGAG CGCAGCGAGT 
CAGTGAGCGA GGAAGCGGAA GAGCGCCCAA TACGCAAACC GCCTCTCCCC GCGCGTTGGC 
CGATTCATTA ATGCAGAATT GATCTCTCAC CTACCAAACA ATGCCCCCCT GCAAAAAATA 210Q.. 
AATTCATATA AAAAACATAC AGATAACCAT CTGCGGTGAT AAATTATCTC TGGCGGTGTT 



420 
480 
540 
600 . 
660 
720 ? 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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1740 
1800 
1860 
1920 
1980 
2040 
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GACATAAATA CCACTGGCGG TGATACTGAG CACATCAGCA GGACGCACTG ACCACCATGA 22 20 

AGGTGACGCT CTTAAAAATT AAGCCCTGAA GAAGGGCAGC ATTCAAAGCA GAAGGCTTTG 2280 

GGGTGTGTGA TACGAAACGA AGCATTGGCC GTAAGTGCGA TTCCGGATTA GCTGCCAATG 2 3 40 

TGCCAATCGC GGGGGGTTTT CGTTCAGGAC TACAACTGCC ACACACCACC AAAGCTAACT 24 00 

GACAGGAGAA TCCAGATGGA TGCACAAACA CGCCGCCGCG AACGTCGCGC AGAGAAACAG 24 60 

GCTCAATGGA AAGCAGCAAA TCCCCTGTTG GTTGGGGTAA GCGCAAAACC AGTTCCGAAA 2520 

GATTTTTTTA ACTATAAACG CTGATGGAAG CGTTTATGCG GAAGAGGTAA AGCCCTTCCC 2580 

GAGTAACAAA AAAACAACAG CATAAATAAC CCCGCTCTTA CACATTCCAG CCCTGAAAAA 2 64 0 

GGGCATCAAA TTAAACCACA CCTATGGTGT ATGCATTTAT TTGCATACAT TCAATCAATT 27 00 

GTTATCTAAG GAAATACTTA CAT ATG CAA GCT AAA CAT AAA CAA CGT AAA 2750 

Met Gin Ala Lys His Lys Gin Arg Lys 
1 5 

CGT CTG AAA TCT AGC TGT AAG AGA CAC CCT TTG TAC GTG GAC TTC AGT 2798 
Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp Phe Ser 
10 15 20 25 

GAC GTG GGG TGG AAT GAC TGG ATT GTG GCT CCC CCG GGG TAT CAC GCC 284 6 

Asp Val Gly Trp Asn Asp Trp lie Val Ala Pro Pro Gly Tyr His Ala 
30 35 40 

TTT TAC TGC CAC GGA GAA TGC CCT TTT CCT CTG GCT GAT CAT CTG AAC 2894 
Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His Leu Asn 
45 50 55 

TCC ACT AAT CAT GCC ATT GTT CAG ACG TTG GTC AAC TCT GTT AAC TCT 2 94 2 

Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser Val Asn Ser 
60 65 70 

AAG ATT CCT AAG GCA TGC TGT GTC CCG ACA GAA CTC AGT GCT ATC TCG 2990 
Lys He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser 
75 ^ 80 85 

ATG CTG TAC CTT GAC GAG AAT GAA AAG GTT GTA TTA AAG AAC TAT CAG 3 03 8 

Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn Tyr Gin 
90 ~ 95 100 105 

GAC ATG GTT GTG GAG GGT TGT GGG TGT CGC TAGTACAGCA AAATTAAATA 3 088 

Asp Met Val Val Glu Gly Cys Gly Cys Arg 
110 115 



CATAAATATA 


TATATATATA 


TATATTTTAG 


AAAAAAGAAA 


AAAATCTAGA 


GTCGACCTGC 


3148 


AGTAATCGTA 


CAGGGTAGTA 


CAAATAAAAA 


AGGCACGTCA 


GATGACGTGC 


CTTTTTTCTT 


3208 


GTGAGCAGTA 


AGCTTGGCAC 


TGGCCGTCGT 


TTTACAACGT 


CGTGACTGGG 


AAAACCCTGG 


3268 


CGTTACCCAA 


CTTAATCGCC 


TTGCAGCACA 


TCCCCCTTTC 


GCCAGCTGGC 


GTAATAGCGA 


3328 


AGAGGCCCGC 


ACCGATCGCC 


CTTCCCAACA 


GTTGCGCAGC 


CTGAATGGCG 


AATGGCGCCT 


3388 
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GATGCGGTAT TTTCTCCTTA CGCATCTGTG CGGTATTTCA CACCGCATAT ATGGTGCACT 3448 

CTCAGTACAA TCTGCTCTGA TGCCGCATAG TTAAGCCAGC CCCGACACCC GCCAACACCC 3508 

GCTGACGCGC CCTGACGGGC TTGTCTGCTC CCGGCATCCG CTTACAGACA AGCTGTGACC 3568 

GTCTCCGGGA GCTGCATGTG TCAGAGGTTT TCACCGTCAT CACCGAAACG CGCGA 3623 

(2) INFORMATION FOR SEQ ID NO: 14: 

c 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Gin Ala Lys His Lys Gin Arg Lys Arg Leu Lys Ser Ser Cys Lys 
1 5 10 15 

Arg His Pro Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp 
20 25 30 

He Val Ala Pro Pro Gly Tyr His Ala Phe Tyr Cys His Gly Glu Cys 
35 40 45 

Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala He Val 
50 55 60 

Gin Thr Leu Val Asn Ser Val Asn Ser Lys He Pro Lys Ala Cys Cys 
65 70 75 80 

Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp Glu Asn 
85 90 95 

Glu Lys Val Val Leu Lys Asn Tyr Gin Asp Met Val Val Glu Gly Cys 
100 105 110 

Gly Cys Arg 
115 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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CATGGGCAGC TGAG 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GAGGGTTGTG GGTGTCGCTA GTGAGTCGAC TACAGCAAAT T 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucieic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGATGTGGGT GCCGCTGACT CTAGAGTCGA CGGAATTC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AATTCACCAT GATTCCTGGT AACCGAATGC T 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GTGGTACTAA GGACCATTGG CTTAC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CGACCTGCAG CCATGCATCT GACTGTA 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TGCCTGCAGT TTAATATTAG TGGCAGC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CGACCTGCAG CCACC 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TCGACCCACC ATGCCGGGGC TGGGGCGGAG GGCGCAGTGG CTGTGCTGGT GGTGGGGGCT 
GTGCTGCAGC TGCTGCGGGC C 
(2) INFORMATION FOR SEQ ID NO: 24 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CGCAGCAGCT GCACAGCAGC CCCCACCACC AGCACAGCCA CTGCGCCCTC CGCCCCAGCC 
CCGGCATGGT COG 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCGACTGGTT T 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
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CGAAACCAG 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TCGACAGGCT CGCCTGCA 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GTCCGAGCGG 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CAGGTCGACC CACCATGCAC GTGCGCTCA 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
TCTGTCGACC TCGGAGGAGC TAGTGGC 
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WHAT IS CLAIMED IS: 

1. A method for producing a heterodimeric 
protein having bone stimulating activity comprising 
culturing a selected host cell containing a sequence 

5 encoding a first selected BMP or fragment thereof and a 
sequence encoding a second selected BMP or fragment 
thereof, said sequences each being under the control of a 
suitable regulatory sequence capable of directing co- 
expression of said proteins, and isolating said 
10 heterodimeric protein from the culture medium. 

• 

2. The method according to claim 1 wherein 
said first BMP or fragment thereof is present on a first 
vector transfected into said host cell and said second 
BMP or fragment thereof is present on a second vector 

15 transfected into said host cell. 

3. The method according to claim 1 wherein 
both said BMPs or fragments thereof are incorporated into 
a chromosome of said host cell* 

4. The method according to claim 1 wherein 
20 both BMPs or fragments thereof are present on a single 

vector. 

5. The method according to claim 2 wherein 
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more than a single copy of the gene encoding each said 
BMP or fragment thereof is present on each vector. 

6. The method according to claim 1 wherein 
said host cell is a hybrid cell prepared by culturing two 
5 fused selected, stable host cells, each host cell 

transfected with a sequence encoding a selected first or 
second BMP or fragment thereof, said sequences under the 
control of a suitable regulatory sequence capable of 
directing expression of each protein or fragment. 

10 7. The method according to claim 1 wherein 

said host cell is a mammalian cell. 

8. The method according to claim 1 wherein 
said host cell is an insect cell. 

9. The method according to claim 1 wherein 
15 said host cell is a yeast cell. 

10. A method for producing a heterodimeric 
protein having bone stimulating activity in a bacterial 
cell comprising culturing a selected host cell containing 
a sequence encoding a first selected BMP or fragment 

20 thereof under the control of a suitable regulatory 

sequence capable of directing expression of the protein 
or protein fragment under conditions suitable for the 
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formation of a soluble, monomeric protein; culturing a 
selected host cell containing a sequence encoding a 
second selected BMP or fragment thereof under the control 
of a suitable regulatory sequence capable of directing 
5 expression of the protein or protein fragment under said 
conditions to form a second soluble, monomeric protein; 
and mixing said soluble monomeric proteins under 
conditions permitting the formation of dimeric proteins 
associated by at least one covalent disulfide bond; 
10 isolating from the mixture a heterodimeric protein. 

11. The method according to claim 10 wherein 
said host cell is £. coli. 

12. The method according to claim 10 wherein 
said conditions comprise treating said protein with a 

15 solubilizing agent. 

13. A recombinant heterodimeric protein having 
bone stimulating activity comprising a first protein or 
fragment of BMP-2 in association with a second protein or 
fragment thereof selected from the group consisting of 

20 BMP-5, BMP-6, BMP-7 and BMP-8. 



14. The protein according to claim 13 wherein 
said second protein is BMP-5. 
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15. The protein according to claim 13 wherein 
said second protein is BMP-6. 

16. The protein according to claim 13 wherein 
said second protein is BMP-7. 

5 17. The protein according to claim 13 wherein 

said second protein is BMP-8. 

18. A recombinant heterodimeric protein having 
bone stimulating activity comprising a protein or 
fragment of BMP-4 in association with a second protein or 

10 fragment thereof selected from the group consisting of 
BMP-5, BMP-6, BMP-7 and BMP-8. 

19. The protein according to claim 18 wherein 
said second protein is BMP-5. 

20. The protein according to claim 18 wherein 
15 said second protein is BMP-6. 

21. The protein according to claim 18 wherein 
said second protein is BMP-7. 

22. The protein according to claim 18 wherein 
said second protein is BMP-8. 
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23. A recombinant heterodimeric protein having 
bone stimulating activity comprising a protein or 
fragment of a first BMP in association with a second 
protein or fragment of a second BMP produced by co- 

5 expressing said proteins in a selected host cell. 

24. The protein according to claim 23 wherein 
said first BMP is BMP-2 and said second BMP is BMP-7. 

25. A cell line comprising a nucleotide 
sequence encoding a first BMP or fragment thereof under 

10 control of a suitable expression regulatory system and a 
nucleotide sequence encoding a second BMP or fragment 
thereof under control of a suitable expression regulatory 
system, said regulatory systems capable of directing the 
co-expression of said BMPs or fragments thereof and the 

15 formation of heterodimeric protein. 

26. The cell line according to claim 25 
wherein said nucleotide sequences encoding said first and 
second BMP proteins are present in a single DNA molecule. 

27. The cell line according to claim 25 

20 wherein said nucleotide sequence encoding said first BMP 
is present on a first DNA molecule and said nucleotide 
sequence encoding said second BMP is present on a second 
DNA molecule. 
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28. The cell line according to claim 26 
wherein said single DNA molecule comprises a first 
transcription unit containing a gene encoding a first BMP 
or fragment thereof and a second transcription unit 
5 containing a gene encoding a second BMP or fragment 
thereof. 



29. The cell line according to claim 26 
wherein said single DNA molecule comprises a single 
transcription unit containing multiple copies of said 

10 gene encoding said first BMP or fragments thereof and 

multiple copies of said gene encoding said second BMP or 
fragments thereof. 

30. A DNA molecule comprising a sequence 
encoding a first selected BMP or fragment thereof and a 

15 sequence encoding a second selected BMP or fragment 

thereof, said sequences under the control of at least one 
suitable regulatory sequence capable of directing co- 
expression of each BMP or fragment thereof. 

31. The molecule according to claim 30 

20 comprising a first transcription unit containing a gene 
encoding a first BMP or fragment thereof and a second 
transcription unit containing a gene encoding a second 
BMP or fragment thereof. 
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32. The molecule according to claim 30 
comprising a single transcription unit containing 
multiple copies of said gene encoding said first BMP or 
fragments thereof and multiple copies of said gene 

5 encoding said second BMP or fragments thereof. 

33. The protein according to claim 23 wherein 
said first BMP is BMP-2 and said second BMP is BMP-6. 

34. A recombinant BMP-2 homodimer having bone 
stimulating activity said homodimer produced in JL. coli. 

10 35. A method for producing a homodimer ic BMP-2 

protein having bone stimulating activity said method 
comprising culturing fLs. coli host cells and isolating and 
purifying said protein from the resulting culture medium. 

36. A recombinant heterodimeric protein having 
15 bone stimulating activity comprising a first protein or 

fragment of BMP-2 in association with a second protein or 
fragment of BMP-2. 
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FIGURE 1A 



10 20 30 40 50 60 70 

GTCGACTCTA GAGTGTGTGT CAGCACTTGG CTGGGGACTT CTTGAACTTG CAGGGAGAAT AACTTGCGCA 



80 90 100 110 120 130 140 

CCCCACTTTG CGCCGGTGCC TTTGCCCCAG CGGAGCCTGC TTCGCCATCT CCGAGCCCCA CCGCCCCTCC 



150 160 170 180 190 200 210 

ACTCCTCGGC CTTGCCCGAC ACTGAGACGC TGTTCCCAGC GTGAAAAGAG AGACTGCGCG GCCGGCACCC 



220 230 240 250 260 270 260 

GGGAGAAGGA GGAGGCAAAG AAAAGGAACG GACATTCGGT CCTTGCGCCA GGTCCTTTGA CCAGAGTTTT 



290 300 310 320 330 340 350 

TCCATGTGGA CGCTCTTTCA ATGGACGTGT CCCCGCGTGC TTCTTAGACG GACTGCGGTC TCCTAAAGGT 



(I) 370 385 400 

CGACC ATG GTG GCC GGG ACC CGC TGT CTT CTA GCG TTG CTG CTT CCC CAG GTC 
MET Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 

415 430 445 

CTC CTG GGC GGC GCG GCT GGC CTC GTT CCG GAG CTG GGC CGC AGG AAG TTC GCG 
Leu Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys Phe Ala 

(24) 

460 475 490 505 

GCG GCG TCG TCG GGC CGC CCC' TCA TCC CAG CCC TCT GAC GAG GTC CTG AGC GAG 

Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu Val Leu Ser Glu 

520 535 550 565 

TTC GAG TTG CGG CTG CTC AGC ATG TTC GGC CTG AAA CAG AGA CCC ACC CCC AGC 
Phe Glu Leu Arg Leu Leu Ser MET Phe Gly Leu Lys Gin Arg Pro Thr Pro Ser 

580 595 610 

AGG GAC GCC GTG GTG CCC CCC TAC ATG CTA GAC CTG TAT CGC AGG CAC TCA GGT 
Arg Asp Ala Val Val Pro Pro Tyr MET Leu Asp Leu Tyr Arg Arg His Ser Gly 

625 640 655 670 

CAG CCG GGC TCA CCC GCC CCA GAC CAC CGG TTG GAG AGG GCA GCC AGC CGA GCC 
Gin Pro Gly Ser Pro Ala Pro Asp His Arg Leu Glu Arg Ala Ala Ser Arg Ala 



SUBSTITUTE SHEET 



WO 93/09229 



2/32 



PCT/US92/09430 



FIGURE IB 



685 700 715 

AAC ACT GTG CGC AGC TTC CAC CAT GAA GAA TCT TTG GAA GAA CTA CCA GAA ACG 
Asn Thr Val Arg Ser Phe His His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr 

730 745 7 60 775 

AGT GGG AAA ACA ACC CGG AGA TTC TTC TTT AAT TTA AGT TCT ATC CCC ACG GAG 

Ser Gly Lys Thr Thr Arg Arg Phe Phe Phe Asn Leu Ser Ser lie Pro Thr Glu 

790 805 820 835 

GAG TTT ATC ACC TCA GCA GAG CTT CAG GTT TTC CGA GAA CAG ATG CAA GAT GCT 
Glu Phe He Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin MET Gin Asp Ala 

850 865 880 

TTA GGA AAC AAT AGC AGT TTC CAT CAC CGA ATT AAT ATT TAT GAA ATC ATA AAA 
Leu Gly Asn Asn Ser Ser Phe His His Arg He Asn He Tyr Glu He He Lys 

895 910 925 940 

CCT GCA ACA GCC AAC TCG AAA TTC CCC GTG ACC AGA CTT TTG GAC ACC AGG TTG 
Pro Ala Thr Ala Asn frer Lys Phe Pro Val Thr Arg Leu Leu Asp Thr Arg Leu 

955 970 985 

GTG AAT CAG AAT GCA AGC AGG TGG GAA AGT TTT GAT GTC ACC CCC GCT GTG ATG 
Val Asn Gin Asn Ala Ser Arg Trp Glu Ser Phe Asp Val Thr Pro Ala Val MET 

1000 1015 1030 1045 

CGG TGG ACT GCA CAG GGA CAC GCC AAC CAT GGA TTC GTG GTG GAA GTG GCC CAC 

Arg Trp Thr Ala Gin Gly His Ala Asn His Gly Phe Val Val Glu Val Ala His 

106 ° * 1075 1090 no5 

TTG GAG GAG AAA CAA GGT GTC TCC AAG AGA CAT GTT AGG ATA AGC AGG TCT TTG 
Leu Glu Glu Lys Gin Gly Val Ser Lys Arg His Val Arg He Ser Arg Ser Leu 

(249) 

1120 H35 H50 

CAC CAA GAT GAA CAC AGC TGG TCA CAG ATA AGG CCA TTG CTA GTA ACT TTT GGC 
His Gin Asp Glu His Ser Trp Ser Gin He Arg Pro Leu Leu Val Thr Phe Gly 

(266) 

1165 1180 1195 1210 

CAT GAT GGA AAA GGG CAT CCT CTC CAC AAA AGA GAA AAA CGT CAA GCC AAA CAC 
His Asp Gly Lys Gly His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His 

(283) 

1225 1240 1255 

AAA CAG CGG AAA CGC CTT AAG TCC AGC TGT AAG AGA CAC CCT TTG TAC -GTG GAC 
Lys Gin Arg Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp 

(296) 

1270 1285 1300 1315 

TTC AGT GAC GTG GGG TGG AAT GAC TGG ATT GTG GCT CCC CCG GGG TAT CAC GCC 
Phe Ser Asp Val Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr His Ala 
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FIGURE 1C 



1330 1345 1360 1375 

TTT TAC TGC *CAC GGA GAA TGC CCT TTT CCT CTG GCT GAT CAT CTG AAC TCC ACT 
Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr 

1390 . 1405 1420 

AAT CAT GCC ATT GTT CAG ACG TTG GTC AAC TCT GTT AAC TCT AAG ATT CCT AAG 
Asn His Ala lie Val Gin Thr Leu Val Asn Ser Val Asn Ser Lys He Pro Lys 

1435 1450 1465 1480 

GCA TGC TGT GTC CCG ACA GAA CTC AGT GCT ATC TCG ATG CTG TAC CTT GAC GAG 
Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser MET Leu Tyr Leu Asp Glu 

1495 1510 1525 

AAT GAA AAG GTT GTA TTA AAG AAC TAT CAG GAC ATG GTT GTG GAG GGT TGT GGG 
Asn Glu Lys Val Val Leu Lys Asn Tvr Gin Asp MET V al Val Glu Glv Cys Gly 

1540(396) 1553 1563 1573 1583 1593 1603 

TGT CGC TAGTACAGCA AAATTAAATA CATAAATATA TATATATATA TATATTTTAG AAAAAAGAAA 
Cys Arg 



AAAA 
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FIGURE 2A 



CTCTAGAGGG CAGAGGAGGA GGGAGGGAGg" GAAGGAGCGC° GGAGCCCGG^CCGGAAGCTAGGTGAGTGTG 

80 90 100 110 120 130 lAo 

GCATCCGAGC TGAGGGACGC GAGCCTGAGA CGCCGCTGCT GCTCCGGCTG AGTATCTAGC TTGTCTCCCC 

150 I 60 1 ?0 180 190 200 ,, n 

GATGGGATTC CCGTCCAAGC TATCTCGAGC CTGCAGCGCC ACAGTCCCCG GCCCTCGCCC AGGTTCACTG 

220 230 240 250 260 270 

CAACCGTTCA GAGGTCCCCA GGAGCTGCTG CTGGCGAGCC CGCTACTGCA GGGACCTATG GAGCCATTCC 

2 ^ 300 310 320 330 340 

GTAGTGCCAT CCCGAGCAAC GCACTGCTGC AGCTTCCCTG AGCCTTTCCA CCAAGTTTGT TCAAGATTGG 

360 370 380 390 400 fl) 

CTGTCAAGAA TCATGGACTG TTATTATATG CCTTGTTTTC TGTCAAGACA CC ATG ATT CCT 

MET He Pro 

^ f GA £™ CTG ATG G * C GTT TOA OTA TGC CAA GTC CTG CTA GGA GGC GCG 
Gly Asn Arg MET Leu MET Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly Ala 

.__ „ m 477 492 507 

fiS St! °f T o GT ^ ATA CCT GAG ACG GGG AAG AAA AAA GTC GCC GAG ATT CAG 
Ser His Ala Ser Leu He Pro Glu Thr Gly Lys Lys Lys Val Ala Glu lie Gin 

522 537 552 567 

G?v SJf GCG o? A 5? A CGC CGC TCA GGG CAG AGC GAG CTC CTG CGG GAC TTC 

Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu Leu Leu Arg Asp Phe 

r AG , °f G 2£ k CTT CTG CAG ATG *™ GGG CTG CGC CGC CGC CCG CAG CCT AGC AAG 
Glu Ala Thr Leu Leu Gin MET Phe Gly Leu Arg Arg Arg Pro Gin Pro Ser £y1 

642 657 675 

Ser Sf S? A T T d CG GAC I AC ATG CGG GAT 0X7 TAC CGG CAG TCT GGG GAG 

Ser Ala Val He Pro Asp Tyr MET Arg Asp Leu Tyr Arg Leu Gin Ser Gly Glu 
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FIGURE 2B 



687 702 717 732 

GAG GAG GAA GAG CAG ATC CAC AGC ACT GGT CTT GAG TAT CCT GAG CGC CCG GCC 
Glu Glu Glu Glu Gin lie His Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala 

747 762 777 

AGC CGG GCC AAC ACC GTG AGG AGC TTC CAC CAC GAA GAA CAT CTG GAG AAC ATC 
Ser Arg Ala Asn Thr Val Arg Ser Phe His His Glu Glu His Leu Glu Asn lie 

792 807 822 837 

CCA GGG ACC AGT GAA AAC TCT GCT TTT CGT TTC CTC TTT AAC CTC AGC AGC ATC 

Pro Gly Thr Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu Ser Ser He 

852 867 882 897 

CCT GAG AAC GAG GTG ATC TCC TCT GCA GAG CTT CGG CTC TTC CGG GAG CAG GTG 
Pro Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu Phe Arg Glu Gin Val 

912 927 942 

GAC CAG GGC CCT GAT TpG GAA AGG GGC TTC CAC CGT ATA AAC ATT TAT GAG GTT 
Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His Arg He Asn He Tyr Glu Val 

957 972 987 1002 

5™ ^ G CCC CCA GCA GAA GTG GTG CCT GGG CAC C TC ATC ACA CGA CTA CTG GAC 
MET Lys Pro Pro Ala Glu Val Val Pro Gly His Leu He Thr Arg Leu Leu Asp 

1017 1032 1047 

ACG AGA CTG GTC CAC CAC AAT GTG ACA CGG TGG GAA ACT TTT GAT GTG AGC CCT 
Thr Arg Leu Val His His Asn Val Thr Arg Trp Glu Thr Phe Asp Val Ser Pro 

1077 1092 H07 

GCG GTC CTT CGC TGG ACC CGG GAG AAG CAG CCA AAC TAT GGG CTA GCC ATT GAG 
Ala Val Leu Arg Trp Thr Arg Glu Lys Gin Pro Asn Tyr Gly Leu Ala He Glu 

1122 H37 1152 116? 

S T ? £u T ?*° CTC CAT CAG ACT CGG ACC CAC CAG GGC CAG CAT GTC AGG ATT AGC 
Val Thr His Leu His Gin Thr Arg Thr His Gin Gly Gin His Val Arg lie Ser 

1182 H97 1212 

CGA TCG TTA CCT CAA GGG AGT GGG AAT TGG GCC CAG CTC CGG CCC CTC CTG GTC 
Arg Ser Leu Pro Gin Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu Val 

1227 1242 1257 1272 

ACC TTT GGC CAT GAT GGC CGG GGC CAT GCC TTG ACC CGA CGC CGG AGG GCC AAG 
Thr Phe Gly His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg Arg Ala Lys 

1287 1302 1317 

CGT AGC CCT AAG CAT CAC TCA CAG CGG GCC AGG AAG AAG AAT AAG AAC TGC CGG 

Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys Arg 
(293) 
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1332 1347 1362 1377 

CGC CAC TCG CTC TAT GTG GAC TTC AGC GAT GTG GGC TGG AAT GAC TGG ATT GTG 

Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp lie Val 

1392 1407 1422 1437 

GCC CCA CCA GGC TAC CAG GCC TTC TAC TGC CAT GGG GAC TGC CCC TTT CCA CTG 
Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly Asp Cys Pro Phe Pro Leu 

1452 1467 1482 

GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC ATT GTG CAG ACC CTG GTC AAT TCT 
Ala Asp His Leu Asn Ser Thr Asn His Ala lie Val Gin Thr Leu Val Asn Ser 

1497 1512 1527 1542 

GTC AAT TCC AGT ATC CCC AAA GCC TGT TGT GTG CCC ACT GAA CTG AGT GCC ATC 
Val Asn Ser Ser lie Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala lie 

1557 1572 1587 

TCC ATG CTG TAC CTG GAT GAG TAT GAT AAG GTG GTA CTG AAA AAT TAT CAG GAG 
Ser MET Leu Tyr Leu A£P Glu Tyr Asp Lys Val Val Leu Lys Asn Tvr Gin Glu 

1602 1617 (408) 1636 1646 1656 

ATG GTA GTA GAG GGA TGT GGG TGC CGC TGAGATCAGG CAGTCCTTGA GGATAGACAG 
MET Val Val Glu Gly cys Gly Cys Arg 

1666 1676 1686 1696 1706 1716 1726 

ATATACACAC CACACACACA CACCACATAC ACCACACACA CACGTTCCCA TCCACTCACC CACACACTAC 



1736 1746 1756 1766 1776 1786 1796 

ACAGACTGCT TCCTTATAGC TGGACTTTTA TTTAAAAAAA AAAAAAAAAA AATGGAAAAA ATCCCTAAAC 



1806 1816 1826 1836 1846 1856 1866 

ATTCACCTTG ACCTTATTTA TGACTTTACG TGCAAATGTT TTGACCATAT TGATCATATA TTTTGACAAA 



1876 1886 1896 1906 1916 1926 1936 

ATATATTTAT AACTACGTAT TAAAAGAAAA AAATAAAATG AGTCATTATT TTAAAAAAAA AAAAAAAACT 



1946 

CTAGAGTCGA CGGAATTC 
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FIGURE 3A 



10 20 30 40 50 

GTGACCGAGC GGCGCGGACG GCCGCCTGCC CCCTCTGCCA CCTGGGGCGG 



60 70 80 90 99 

TGCGGGCCCG GAGCCCGGAG CCCGGGTkGC GCGTAGAGCC GGCGCG ATG 

MET 
(1) 

108 117 126 135 144 

CAC GTG CGC TCA CTG CGA GCT GCG GCG CCG CAC AGC TTC GTG GCG 
His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 

153 .162 171 180 189 

CTC TGG GCA CCC CTG TTC CTG CTG CGC TCC GCC CTG GCC GAC TTC 
Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe 

198 207 216 225 234 

AGC CTG GAC AAC GAG GTG CAC TCG AGC TTC ATC CAC CGG CGC CTC 
Ser Leu Asp Asn Glu Val His Ser Ser Phe He His Arg Arg Leu 

243 252 261 270 279 

CGC AGC CAG GAG CGG CGG GAG ATG CAG CGC GAG ATC CTC TCC ATT 
Arg Ser Gin Glu Arg Arg Glu MET Gin Arg Glu He Leu Ser He 

288 297 306 315 324 

TTG GGC TTG CCC CAC CGC CCG CGC CCG CAC CTC CAG GGC AAG CAC 
Leu Gly Leu Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His 

333 342 351 360 369 

AAC TCG GCA CCC ATG TTC ATG CTG GAC CTG TAC AAC GCC ATG GCG 
Asn Ser Ala Pro MET Phe MET Leu Asp Leu Tyr Asn Ala MET Ala 

378 387 396 405 414 

GTG GAG GAG GGC GGC GGG CCC GGC GGC CAG GGC TTC TCC TAC CCC 
Val Glu Glu Gly Gly Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro 

423 432 441 450 459 

TAC AAG GCC GTC TTC AGT ACC CAG GGC CCC CCT CTG GCC AGC CTG 
Tyr Lys Ala Val Phe Ser Thr Gin Gly Pro Pro Leu Ala Ser Leu 

468 477 486 495 504 

CAA GAT AGC CAT TTC CTC ACC GAC GCC GAC ATG GTC ATG AGC TTC 
Gin Asp Ser His Phe Leu Thr Asp Ala Asp MET Val MET Ser Phe 

513 522 531 540 549 

GTC AAC CTC GTG GAA CAT GAC AAG <5AA TTC TTC CAC CCA CGC TAC 
Val Asn Leu Val Glu His Asp Lys Glu Phe Phe His Pro Arg Tyr 
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FIGURE 3B 



558 567 576 585 594 

CAC CAT CGA GAG TTC CGG TTT GAT CTT TCC AAG ATC CCA GAA GGG 
His His Arg Glu Phe Arg Phe Asp Leu Ser Lys lie Pro Glu Gly 

603 612 621 630 639 

GAA GCT GTC ACG GCA GCC GAA TTC CGG ATC TAC AAG GAC TAC ATC 
Glu Ala Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp Tyr He 

648 657 666 675 684 

CGG GAA CGC TTC GAC AAT GAG ACG TTC CGG ATC AGC GTT TAT CAG 
Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg He Ser Val Tyr Gin 

693 702 711 720 729 

GTG CTC CAG GAG CAC TTG GGC AGG GAA TCG GAT CTC TTC CTG CTC 
Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu Phe Leu Leu 

738 747 756 765 774 

GAC AGC CGT ACC CTC TGG GCC TCG GAG GAG GGC TGG CTG GTG TTT 
Asp Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu Val Phe 

' 783 792 801 810 819 

GAC ATC ACA GCC ACC AGC AAC CAC TGG GTG GTC AAT CCG CGG CAC 
Asp He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His 

828 837 846 855 864 

AAC CTG GGC CTG CAG CTC TCG GTG GAG ACG CTG GAT GGG CAG AGC 
Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser 

873 882 891 900 909 

ATC AAC CCC AAG TTG GCG GGC CTG ATT GGG CGG CAC GGG CCC CAG 
He Asn Pro Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin 

918 927 936 945 954 

AAC AAG CAG CCC TTC ATG GTG GCT TTC TTC AAG GCC ACG GAG GTC 
Asn Lys Gin Pro Phe MET Val Ala Phe Phe Lys Ala Thr Glu Val 

963 972 981 990 999 

CAC TTC CGC AGC ATC CGG TCC ACG GGG AGC AAA CAG CGC AGC CAG 
His Phe Arg Ser He Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin 

(293) 

1008 1017 1026 1035 1044 

AAC CGC TCC AAG ACG CCC AAG AAC CAG GAA GCC CTG CGG ATG GCC 
Asn Arg Ser Lys Thr Pro Lys Asn Gin Gl u Ala Leu Ara MET Ala 

1053 1062 1071 1080 1089 

AAC GTG GCA GAG AAC AGC AGC AGC GAC CAG AGG CAG GCC TGT AAG 
Asn Val Ala Glu Asn Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys 
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FIGURE 3C 



JiS Hi^l t CTG Z^*™ AGC TTc1 ^ GAC CTG^GC TGG CAG^AC 
LyS Hls Glu T V f m jer Pne Arg Asp Leu Gly Trp Gin Asp 

?rp nelH K GGC ™™ gcc »Ac"2 tgt gag'ggg 

Trp lie He Ala Pro Glu Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly 

1188 1197 1206 1215 nn. 

Glu 25! G ?° S° CCT 010 AAC TCC TAC ATG AAC G^C ACC AAC CAC 
Glu cys Ala Phe Pro Leu Asn Ser Tyr MET Asn Ala Thr Asn His 

GCC ^^P? CAG GTC CAc1 TTC ATC AAc'cCG GAA ACG^G 

Ala He Val Gin Thr Leu Val His Phe He Asn Pro ite sir Vat 

Pro fi G l CC TGC TGt1gCG CCC ACG "ag CTC AAt'gSc ATC TCC^C 
Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala lie Ser Va? 

1323 1332 1341 nt;n 

fin l A ° 5° GAT GAC AGC TCC GTC ATC CTG AAG AAA TAC^Ga" 

Leu Tyr Phe Asp Asp Ser Ser Asn Val He lJu EJs L^ ™r irg 

1368 1377 1386 13QQ 

ten m S5 » GG GCC TGT GGC TGC CAC TAGCTCCTCC 

Asn MET Val Val Arg Ala Cys Gly Cys His 

(431) 

gagaatJcag accctttcgg gccaagtttt tctggatcct ccattggtc 



SUBSTITUTE SHEET 



WO 93/09229 



10/32 



PCT/US92/09430 



FIGURE 4A 



!0 20 30 40 50 

CGACCATGAG AGATAAGGAC TGAGGGCCAG GAAGGGGAAG CGAGCCCGCC 

60 70 80 90 100 

GAGAGGTGGC GGGGACTGCT CACGCCAAGG GCCACAGCGG CCGCGCTCCG 

HO 120 130 140 150 

GCCTCGCTCC GCCGCTCCAC GCCTCGCGGG ATCCGCGGGG GCAGCCCGGC 

159 168 177 186 195 

CGGGCGGGG ATG CCG GGG CTG GGG CGG AGG GCG CAG TGG CTG TGC 

MET Pro Gly Leu Gly Arg Arg Ala Gin Tip Leu Cys 

204 213 222 231 240 

TGG TGG TGG GGG CTG CTG TGC AGC TGC TGC GGG CCC CCG CCG CTG 
Trp Trp Trp Gly Leu Leu Cys Ser Cys Cys Gly Pro Pro Pro Leu 

249 258 267 276 285 

CGG CCG CCC TTG CCC GCT GCC GCG GCC GCC GCC GCC GGG GGG CAG 
Arg Pro Pro Leu Pro Ala Ala Ala Ala Ala Ala Ala Gly Gly Gin 

294 303 312 321 330 

CTG CTG GGG GAC GGC GGG AGC CCC GGC CGC ACG GAG CAG CCG CCG 
Leu Leu Gly Asp Gly Gly Ser Pro Gly Arg Thr Glu Gin Pro Pro 

339 348 357 366 375 

CCG TCG CCG CAG TCC TCC TCG GGC TTC CTG TAC CGG CGG CTC AAG 
Pro Ser Pro Gin Ser Ser Ser Gly Phe Leu Tyr Arg Arg Leu Lys 

384 393 402 411 420 

ACG CAG GAG AAG CGG GAG ATG CAG AAG GAG ATC TTG TCG GTG CTG 
Thr Gin Glu Lys Arg Glu MET Gin Lys Glu He Leu Ser Val Leu 



429 438 447 456 465 

GGG CTC CCG CAC CGG CCC CGG CCC CTG CAC GGC CTC CAA CAG CCG 
Gly Leu Pro His Arg Pro Arg Pro Leu His Gly Leu Gin Gin Pro 
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FIGURE 4B 

474 483 492 501 510 

CAG CCC CCG GCG CTC CGG CAG CAG GAG GAG CAG CAG CAG CAG CAG 

Gin Pro Pro Ala Leu Arg Gin Gin Glu Glu Gin Gin Gin Gin Gin 

519 528 537 546 555 

CAG CTG CCT CGC GGA GAG CCC CCT CCC GGG CGA CTG AAG TCC GCG 
Gin Leu Pro Arg Gly Glu Pro Pro Pro Gly Arg Leu Lys Ser Ala 

„„„ _ 564 573 5B2 591 600 

CCC CTC TTC ATG CTG GAT CTG TAC AAC GCC CTG TCC GCC GAC AAC 

Pro Leu Phe MET Leu Asp Leu Tyr Asn Ala Leu Ser Ala Asp Asn 

609 618 627 636 645 

GAC GAG GAC GGG GCG TCG GAG GGG GAG AGG CAG CAG TCC TGG CCC 
Asp Glu Asp Gly Ala Ser Glu Gly Glu Arg Gin Gin Ser Trp Pro 

. 654 663 672 681 690 

CAC GAA GCA GCC AGC TCG TCC CAG CGT CGG CAG CCG CCC CCG GGC 
His Glu Ala Ala Ser Ser Ser Gin Arg Arg Gin Pro Pro Gly Ser 

699 708 717 726 735 

GCC GCG CAC CCG CTC AAC CGC AAG AGC CTT CTG GCC CCC GGA TCT 
Pro Pro Gly Ala Ala His Pro Leu Asn Arg Lys Ser Leu Leu Ala 

744 753 762 771 780 

GGC AGC GGC GGC GCG TCC CCA CTG ACC AGC GCG CAG GAC AGC GCC 
Gly Ser Gly Gly Ala Ser Pro Leu Thr Ser Ala Gin Asp Ser Ala 

789 798 807 816 825 

TTC CTC AAC GAC GCG GAC ATG GTC ATG AGC TTT GTG AAC CTG GTG 
Phe Leu Asn Asp Ala Asp MET Val MET Ser Phe Val Asn Leu Val 

834 843 852 861 870 

GAG TAC GAC AAG GAG TTC TCC CCT CGT CAG CGA CAC CAC AAA GAG 
Glu Tyr Asp Lys Glu Phe Ser Pro Arg Gin Arg His His Lys Glu 

™, 879 888 897 906 915 

TTC AAG TTC AAC TTA TCC CAG ATT CCT GAG GGT GAG GTG GTG ACG 
Phe Lys Phe Asn Leu Ser Gin He Pro Glu Gly Glu Val Val Thr 

924 933 96£) 

GCT GCA <GAA TTC CGC ATC TAC AAG GAC TGT GTT ATG GGG AGT TTT 
Phe Arg He Tyr Lys Asp Cys Val MET Ala Ala Glu Gly Ser Phe 
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FIGURE 4C 



969 978 987 996 1005 

AAA AAC CAA ACT TTT CTT ATC AGC ATT TAT CAA GTC TTA CAG GAG 
Lye Asn Gin Thr Phe Leu lie Ser He Tyr Gin Val Leu Gin Glu 

< 

1014 1023 1032 1041 1050 

5^° AGA GAC TCT GAC CTG TTT TTG TTG GAC ACC CGT GTA 
Hie Gin Hie Arg Aep Ser Aep Leu Phe Leu Leu Aep Thr Arg Val 

1059 1068 1077 1086 1095 

J 66 G fC TCA GAA GAA GGC TGG CTG GAA TTT GAC ATC ACG GCC 
Val Trp Ala Ser Glu Glu Gly Trp Leu Glu Phe Aep He Thr Ala 

«„„ 1104 1113 !122 1131 H40 

ACT AGC AAT CTG TGG GTT GTG ACT CCA CAG CAT AAC ATG GGG CTT 
Thr Ser Aen Leu Trp Val Val Thr Pro Gin Hie Aen MET Gly Leu 

1149 1158 1167 1176 lias 

CAG CTG 'AGC GTG GTG ACA AGG GAT GGA CTC CAC GTC CAC CCC CGA 
Gin Leu Ser Val Val Thr Arg Aep Gly Val Hie Val Hie Pro Arg 

1194 1203 1212 1221 1230 

Sff ??? S? C ? TG GTG GGC AGA GA C GGC CCT TAC GAT AAG CAG CCC 
Ala Ala Gly Leu Val Gly Arg Aep Gly Pro Tyr Aep Lye Gin Pro 

1239 1248 1257 1266 157l . 

*S f, TG GCT T ? C "« AAA GTG AGT GAG GTC CAC GTG CGC ACC 
Phe MET Val Ala Phe Phe Lye Val Ser Glu Val Hie Val Arg Thr 

1284 1293 1302 1311 1320 

1°* G ? C TCC AGC CGG CGC CGA CAA CAG AGT CGT AAT CGC 
Thr Arg Ser Ala Ser Ser Arg Arg Arg Gin Gin Ser Arg Aen Arg 

(382) 

1329 1338 1347 1356 isss 

e 01 ACC £A G TCC CAG GAC GTG GCG CGG GTC TCC AGT GCT TCA GAT 
SerThr GJn Se r Gin Ab P y a1 Arg Val Ser Ser aS sS Asp 

1374 1383 1392 1401 1410 

TAC AAC AGC AGT GAA TTG AAA ACA GCC TGC AGG AAG CAT GAG CTG 
Tyr Aen Ser Ser Glu Leu Lye Thr Ala Cye Arg Lye His Glu rJn 

(412) 

1419 1428 1437 1446 14S5 

TAT GTG AGT TTC CAA GAC CTG GGA TGG CAG GAC TGG ATC ATT GCA 
Tyr Val Ser Phe Gin Aep Leu Gly Trp Gin Aep Trp He He Ala 
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FIGURE 4D 



1464 1473 1482 1491 1500 

CCC AAG GGC TAT GCT GCC AAT TAC TGT GAT GGA GAA TGC TCC TTC 
Pro Lys Gly Tyr Ala Ala Asn Tyr Cys Asp Gly Glu Cys Ser Phe 



1509 1518 1527 1536 1545 

CCA CTC AAC GGA CAC ATG AAT GCA ACC AAC CAC GCG ATT GTG CAG 
Pro Leu Asn Ala His MET Asn Ala Thr Asn His Ala He Val Gin 



1554 1563 1572 1581 1590 

ACC TTG GTT CAC CTT ATG AAC CCC GAG TAT GTC CCC AAA CCG TGC 
Thr Leu Val His Leu MET Asn Pro Glu Tyr Val Pro Lys Pro Cys 

1599 1608 1617 1626 1635 

TGT GCG CCA ACT AAG CTA AAT GCC ATC TCG GTT CTT TAC TTT GAT 
Cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe Asp 



1644 1653 1662 1671 1680 

GAC AAC TCC AAT GTC ATT CTG AAA AAA TAC AGG AAT ATG GTT GTA 

Asp Asn Ser Asn Val He Leu Lys Lys Tyr Arg Asn MET Val Val 



1689 1698 1708 1718 1728 

AGA GCT TGT GGA TGC CAC TAACTCGAAA CCAGATGCTG GGGACACACA 

Arg Ala Cys Gly Cys His 

(513) 



1738 


1748 


1758 


1768 


1778 


TTCTGCCTTG 


GATTCCTAGA 


TTACATCTGC 


CTTAAAAAAA 


CACGGAAGCA 


1788 


1798 


1808 


1818 


1826 


CAGTTGGAGG 


TGGGACGATG 


AGACTTTGAA 


ACTATCTCAT 


GCCAGTGCCT 


1838 


1848 


1858 


1868 


1878 


TATTACCCAG 


GAAGATTTTA 


AAGGACCTCA 


TTAATAATTT 


GCTCACTTGG 


1888 


1898 


1908 


1918 


1928 


TAAATGACGT 


GAGTAGTTGT 


TGGTCTGTAG 


CAAGCTGAGT 


TTGGATGTCT 


1938 


1948 


1958 


1968 


1978 


GTAGCATAAG 


GTCTGGTAAC 


TGCAGAAACA 


TAACCGTGAA 


GCTCTTCCTA 


1988 


1998 


2008 


2018 


2028 


CCCTCCTCCC 


CCAAAAACCC 


ACCAAAATTA 


GTTTTAGCTG 


TAGATCAAGC 


2038 


2048 


2058 


2068 


2078 


TATTTGGGGT 


GTTTGTTAGT 


AAATAGGGAA 


AATAATCTCA 


AAGGAGTTAA 


2088 


2098 


2108 


2118 


2128 


ATGTATTCTT 


GGCTAAAGGA 


TCAGCTGGTT 


CAGTACTGTC 


TATCAAAGGT 
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14/32 FIGURE 4E 



2138 2148 2158 2168 2178 

AGATTTTACA GAGAACAGAA ATCGGGGAAG TGGGGGGAAC GCCTCTGTTC 

2188 2198 2208 2218 2228 

AGTTCATTCC CAGAAGTCCA CAGGACGCAC AGCCCAGGCC ACAGCCAGGG 

2238 2248 2258 2268 2278 

CTCCACGGGG CGCCCTTGTC TCAGTCATTG CTGTTGTATG TTCGTGCTGG 

2288 2298 2308 2318 2328 

AGTTTTGTTG GTGTGAAAAT ACACTTATTT CAGCCAAAAC ATACCATTTC 

2338 23 48 2358 2368 2378 

TACACCTCAA TCCTCCATTT GCTGTACTCT TTGCTAGTAC CAAAAGTAGA 

2388 2398 2408 2418 OA?a 

CTGATTACAC TGAGGTGAGG CTACAAGGGG TGTGTAACCG TGTAACACGT 

2438 2448 2458 2468 2478 

GAAGGCAGTG CTCACCTCTT CTTTACCAGA ACGGTTCTTT GACCAGCACA 

2488 2498 2508 2518 ?«?oo 

TTAACTTCTG GACTGCCGGC TCTAGTACCT TTTCAGTAAA GTGGTTCTCT 

2538 2548 2558 2568 lava 

GCCTTTTTAC TATACAGCAT ACCACGCCAC AGGGTTAGAA CCAACGAAGA 

2588 2598 2608 2618 OAia 

AAATAAAATG AGGGTGCCCA GCTTATAAGA ATGGTGTTAG GGGGATGAGC 

2638 2648 2658 2668 » fi - c 

ATGCTGTTTA TGAACGGAAA TCATGATTTC CCTGTAGAAA GTGAGGCTCA 

2688 2698 2708 2718 2728 

GATTAAATTT TAGAATATTT TCTAAATGTC TTTTTCACAA TCATGTGACT 

2738 2748 2758 2768 2778 

GGGAAGGCAA TTTCATACTA AACTGATTAA ATAATACATT TATAATCTAC 



2788 2798 2808 2818 



AACTGTTTGC ACTTACAGCT TTTTTTGTAA ATATAAACTA TAATTTATTG 
2838 2848 2858 2868 



TCTATTTTAT ATCTGTTTTG CTGTGGCGTT GGGGGGGGGG CCGGGCTTTT 

2888 2898 2908 2918 

GGGGGGGGGG GTTTGTTTGG GGGGTGTCGT GGTGTGGGCG GGCGG 
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FIGURE 5A 



10 20 30 40 50 

CTGGTATATT TGTGCCTGCT GGAGGTGGAA TTAACAGTAA GAAGGAGAAA 

60 70 80 90 100 

GGGATTGAAT GGACTTACAG GAAGGATTTC AAGTAAATTC AGGGAAACAC 

HO 120 130 140 150 

ATTTACTTGA ATAGTACAAC CTAGAGTATT ATTTTACACT AAGACGACAC 

160 170 180 190 200 

AAAAGATGTT AAAGTTATCA CCAAGCTGCC GGACAGATAT ATATTCCAAC 

210 220 230 240 250 

ACCAAGGTGC AGATCAGCAT AGATCTGTGA TTCAGAAATC AGGATTTGTT 

260 270 280 290 300 

TTGGAAAGAG CTCA^GGGTT GAGAAGAACT CAAAAGCAAG TGAAGATTAC 

310 320 330 340 350 

TTTGGGAACT ACAGTTTATC AGAAGATCAA CTTTTGCTAA TTCAAATACC 

360 370 380 390 400 

AAAGGCCTGA TTATCATAAA TTCATATAGG AATGCATAGG TCATCTGATC 

410 420 430 440 450 

AAATAATATT AGCCGTCTTC TGCTACATCA ATGCAGCAAA AACTCTTAAC 

460 470 480 490 500 

AACTGTGGAT AATTGGAAAT CTGAGTTTCA GCTTTCTTAG AAATAACTAC 

510 520 530 540 550 

TCTTGACATA TTCCAAAATA TTTAAAATAG GACAGGAAAA TCGGTGAGGA 

560 570 580 590 600 

TGTTGTGCTC AGAAATGTCA CTGTCATGAA AAATAGGTAA ATTTGTTTTT 

610 620 630 640 650 

TCAGCTACTG GGAAACTGTA CCTCCTAGAA CCTTAGGTTT TTTTTTTTTT 

660 670 680 690 700 

AAGAGGACAA GAAGGACTAA AAATATCAAC TTTTGCTTTT GGACAAAA 
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FIGURE SB 



728 737 
CTT AAG GGT ATT GTG GGT TTC CTC 
Leu Lys Gly lie Val Gly Phe Leu 



746 755 764 773 782 

TGG AGC TGC TGG GTT CTA GTG GGT TAT GCA AAA GGA GGT TTG GGA 

Trp Ser Cys Trp Val Leu Val Gly Tyr Ala Lys Gly Gly Leu Gly 

791 800 809 818 827 

GAC AAT CAT GTT CAC TCC AGT TTT ATT TAT AGA AGA CTA CGG AAC 

Asp Asn His Val ftis Ser Ser Phe He Tyr Arg Arg Leu Arg Asn 

836 845 854 863 872 

CAC GAA AGA CGG GAA ATA CAA AGG GAA ATT CTC TCT ATC TTG GGT 

His Glu Arg Arg Glu He Gin Arg Glu He Leu Ser He Leu Gly 

881 890 899 908 917 

TTG CCT CAC AGA CCC AGA CCA TTT TCA CCT GGA AAA ATG ACC AAT 

Leu Pro His Arg Pro Arg Pro Phe Ser Pro Gly Lys Gin Ala Ser 

926 935 944 953 962 

CAA GCG TCC TCT GCA CCT CTC TTT ATG CTG GAT CTC TAC AAT GCC 

Ser Ala Pro Leu Phe MET Leu Asp Leu Tyr Asn Ala MET Thr Asn 

971 980 989 998 1007 

GAA GAA AAT CCT GAA GAG TCG GAG TAC TCA GTA AGG GCA TCC TTG 

Glu Glu Asn Pro Glu Glu Ser Glu Tyr Ser Val Arg Ala Ser Leu 

1016 1025 1034 1043 1052 

GCA GAA GAG ACC AGA GGG GCA AGA AAG GGA TAC CCA GCC TCT CCC 
Ala Glu Glu Thr Arg Gly Ala Arg Lys Gly Tyr Pro Ala Ser Pro 

1061 1070 1079 1088 1097 

AAT GGG TAT CCT CGT CGC ATA CAG TTA TCT CGG ACG ACT CCT CTG 
Asn Gly Tyr Pro Arg Arg He Gin Leu Ser Arg Thr Thr Pro Leu 

1106 1115 1124 1133 1142 

ACC ACC CAG AGT CCT CCT CTA GCC AGC CTC CAT GAT ACC AAC TTT 
Thr Thr Gin Ser Pro Pro Leu Ala Ser Leu His Asp Thr Asn Phe 

1151 1160 1169 1178 1187 

CTG AAT GAT GCT GAC ATG GTC ATG AGC TTT GTC AAC TTA GTT GAA 
Leu Asn Asp Ala Asp MET Val MET Ser Phe Val Asn Leu Val Glu 

1196 1205 1214 1223 1232 

AGA GAC AAG GAT TTT TCT CAC CAG CGA AGG CAT TAC AAA GAA TTT 
Arg Asp Lys Asp Phe Ser His Gin Arg Arg His Tyr Lys Glu Phe 



701 710 719 

ATG CAT CTG ACT GTA TTT TTA 
MET His Leu Thr Val Phe Leu 
(1) 
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FIGURE 5C 



12 41 1250 1259 1268 1277 

CGA TTT GAT CTT ACC CAA ATT CCT CAT GGA GAG GCA GTG ACA GCA 
Arg Phe Asp Leu Thr Gin He Pro His Gly Glu Ala Val Thr Ala 

1286 1295 1304 1313 1322 

GCT GAA TTC CGG ATA TAC AAG GAC CGG AGC AAC AAC CGA TTT GAA 
Ala Glu Phe Arg He Tyr Lys Asp Arg Ser Asn Asn Arg Phe Glu 

1331 1340 1349 1358 1367 

AAT GAA ACA ATT AAG ATT AGC ATA TAT CAA ATC ATC AAG GAA TAC 
Asn Glu Thr He Lys He Ser He Tyr Gin He He Lys Glu Tyr 

1376 1385 1394 1403 1412 

ACA AAT AGG GAT GCA GAT CTG TTC TTG TTA GAC ACA AGA AAG GCC 
Thr Asn Arg Asp Ala Asp Leu Phe Leu Leu Asp Thr Arg Lys Ala 

1421 1430 1439 1448 1457 

CAA GCT TTA GAT GTG GGT TGG CTT GTC TTT GAT ATC ACT GTG ACC 
Gin Ala ' Leu Asp Val Gly Trp Leu Val Phe Asp He Thr Val Thr 

'1466 1475 1484 1493 1502 

AGC AAT CAT TGG GTG ATT AAT CCC CAG AAT AAT TTG GGC TTA CAG 
Ser Asn His Trp Val He Asn Pro Gin Asn Asn Leu Gly Leu Gin 

1511 1520 1529 1538 1547 

CTC TGT GCA GAA ACA GGG GAT GGA CGC AGT ATC AAC GTA AAA TCT 
Leu Cys Ala Glu Thr Gly Asp Gly Arg Ser He Asn Val Lys Ser 

1556 1565 1574 1583 1592 

GCT GGT CTT GTG GGA AGA CAG GGA CCT CAG TCA AAA CAA CCA TTC 
Ala Gly Leu Val Gly Arg Gin Gly Pro Gin Ser Lys Gin Pro Phe 

1601 1610 1619 1628 1637 

ATG GTG GCC TTC TTC AAG GCG AGT GAG GTA CTT CTT CGA TCC GTG 
MET Val Ala Phe Phe Lys Ala Ser Glu Val Leu Leu Arg Ser Val 

1646 1655 1664 1673 1682 

AGA GCA GCC AAC AAA CGA AAA AAT CAA AAC CGC AAT AAA TCC AGC 
Arg Ala Ala Asn Lys Arg Lys Asn Gin Asn Arg Asn Lys Ser Ser 

(329) 

1691 1700 1709 1718 1727 

TCT CAT CAG GAC TCC TCC AGA ATG TCC AGT GTT GGA GAT TAT AAC 
Ser His Gin Asp Ser Ser Arc MET Ser Ser Val Gly Asp Tyr Asn 

(337) 
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FIGURE 5D 



1736 1745 1754 1763 1772 

ACA AGT GAG CAA AAA CAA GCC TGT AAG AAG CAC GAA CTC TAT GTG 
Thr Ser Glu Gin Lys Gin Ala Cys Lys Lys His Glu Leu Tvr Val 

(356) 

1781 1790 1799 1808 1817 

AGC TTC CGG GAT CTG GGA TGG CAG GAC TGG ATT ATA GCA CCA GAA 
per Phe Arg Asp Leu Gly Trp Gin Asp Trp lie He Ala Pro Glu 



1826 1835 1844 1853 1862 

GGA TAC GCT GCA TTT TAT TGT GAT GGA GAA TGT TCT TTT CCA CTT 
Gly Tyr Ala Ala Phe Tyr Cys Asp Gly Glu Cys Ser Phe Pro Leu 

1871 1880 1889 1898 1907 

AAC GCC CAT ATG AAT GCC ACC AAC CAC GCT ATA GTT CAG ACT CTG 
Asn Ala His MET Asn Ala Thr Asn His Ala He Val Gin Thr Leu 

1916 1925 1934 1943 1952 

GTT CAT CTG ATG TTT CCT GAC CAC GTA CCA AAG CCT TGT TGT GCT 
Val His Leu MET Phe Pro Asp His Val Pro Lys Pro Cys Cys Ala 

1961 1970 1979 1988 1997 * 

CCA ACC AAA TTA AAT GCC ATC TCT GTT CTG TAC TTT GAT GAC AGC 
Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe Asp Asp Ser 

2006 2015 2024 2033 2042 

TCC AAT GTC ATT TTG AAA AAA TAT AGA AAT ATG GTA GTA CGC TCA 
Ser Asn Val He Leu Lys Lys Tyr Arg Asn MET Val Val Arg Ser 



2051 2060 2070 2080 2090 2100 

TGT GGC TGC CAC TAATATTAAA TAATATTGAT AATAACAAAA AGATCTGTAT 
Cys Gly Cys His 
(454) 

2110 2120 2130 2140 2150 

TAAGGTTTAT GGCTGCAATA AAAAGCATAC TTTCAGACAA ACAGAAAAAA AAA 



SUBSTITUTE SHEET 



WO 93/09229 • 19/32 PCT/US92/09430 

Figure 6 

(1) 

GAATTCC GAG CCC CAT TGG AAG GAG TTC CGC TTT GAC CTG ACC CAG ATC CCG GCT 
Glu Pro His Trp Lys Glu Phe Arg Phe Asp Leu Thr Gin lie Pro Ala 

(10) 

GGG GAG GCG GTC ACA GCT GCG GAG TTC CGG ATT TAC AAG GTG CCC AGC ATC CAC 
Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He Tyr Lys Val Pro Ser He His 
(20) (30) 

CTG CTC AAC AGG ACC CTC CAC GTC AGC ATG TTC CAG GTG GTC CAG GAG CAG TCC 
Leu Leu Asn Arg Thr Leu His Val Ser Met Phe Gin Val Val Gin Glu Gin Ser 

(40) (50) 

AAC AGG GAG TCT GAC TTG TTC TTT TTG GAT CTT CAG ACG CTC CGA GCT GGA GAC 
Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin Thr Leu Arg Ala Gly Asp 

(60) (70) 

GAG GGC TGG CTG GTG CTG GAT GTC ACA GGA GCC AGT GAC TGC TGG TTG CTG AAG 
Glu Gly Typ Leu Val Leu Asp Val Thr Ala Ala Ser Asp eye Trp Leu Leu Lys 

(80) 

CGT CAC AAG GAC CTG GGA CTC CGC CTC TAT GTG GAG ACT GAG GAT GGG CAC AGC 
Arg His Lys Asp Leu Gly Lue Arg Leu Tyr Val Glu Thr Glu Asp Gly His Ser 
(90) (100) 

GTG GAT CCT GGC CTG GCC GGC CTG CTG GGT CAA CGG GCC CCA CGC TCC CAA CAG 
Val Asp Pro Gly Leu Ala Gly Leu Leu Gly Gin Arg Ala Pro Arg Ser Gin Gin 
(HO) ~ (120) 

CCT TTC GTG GTC ACT TTC TTC AGG GCC AGT CCG AGT CCC ATC CGC ACC CCT CGG 
Pro Phe Val Val Thr Phe Phe Arg Ala Ser Pro Ser Pro He Arg Thr Pro Arg 

(130) (140) 

GCA GTG AGG CCA CTG AGG AGG AGG CAG CCG AAG AAA AGC AAC GAG CTG CCG CAG 
Ala Val Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu Pro Gin 

(150) (160) 

GCC AAC CGA CTC CCA GGG ATC TTT GAT GAC GTC CAC GGC TCC CAC GGC CGG CAG 
Ala Asn Arg Leu Pro Gly He Phe Asp Asp Val His Gly Ser His Gly Arg Gin 

(170) 

GTC TGC CGT CGG CAC GAG CTC TAC GTC AGC TTC CAG GAC CTT GGC TGG CTG GAC 
Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu Asp 
(180) (190) 

TGG GTC ATC GCC CCC CAA GGC TAC TCA GCC TAT TAC TGT GAG GGG GAG TGC TCC 
Trp Val He Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu Cys Ser 
(200) (210) 

TTC CCG CTG GAC TCC TGC ATG AAC GCC ACC AAC CAC GCC ATC CTG CAG TCC CTG 
Phe Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala He Leu Gin Ser Leu 

(220) (230) 
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Figure 6 (Con't) 



GTG CAC CTG ATG AAG CCA AAC GCA GTC CCC AAG GCG TGC TGT GCA CCC ACC AAG 
Val His Leu Met Lys Pro Asn Ala Val Pro Lys Ala Cys Cys Ala Pro Thr Lys 

(240) (250) 

CTG AGC GCC ACC TCT GTG CTC TAC TAT GAC AGC AGC AAC AAC GTC ATC CTG CGC 
Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg 

(260) 

AAG CAC CGC AAC ATG GTG GTC AAG GCC TGC GGC TGC CAC TGAGTCAGCCCGCCCAGC 
Lys His Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
(270) ( 280) 

CCTACTGCAGCCACCCTTCTCATCTGGATCGGGCCCTGCAGAGGCAGAAAACCCTTAAATGCTGTCACAG 
CTCAAGCAGGAGTGTCAGGGGCCCTCACTCTCGGTGCCTACTTCCTGTCAGGCTTCTGGGAATTC 
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FIGURE 7 

0AC6AAA00G CCTCOTOATA COCCTATTTT TAJAOQTTAA TOTCATOATA ATAAIOOTTT 60 
CTTAOACGTC AOOTOOCACT TIICOgCOAA ATOTCCOCOO AACCCOtATt TOITIATTW 120 
TCTAAATACA TTCAAATATO TATCCOCTCA TOAOACAATA ACOOTCATAA AWCWCAAT 180 
AATATTOAAA AAOOAAOACT ATOAOTATTC AACATTTCOC TOTOOCCCTT ATTCCCTTW 240 
WOCOOOAW WOCCWCCT CTTTTTOCTC ACCCAOAAAC 0CTCOT0AAA CIAAAAOATO 300 
CTOAAOATCA OTTCCOTOCA OOAOTOOOW ACAT06AACT OOATCTCAAC AOCOOTAMA 360 
TCCTTOAOAO TTTTCOCCCC OAAOAACOTT TTCCAATCAT OAOCACTITT AAAOTTCT6C 420 
TATOTOOCGC OOTATTATCC CGfATTOACC CCOOOCAACA OCAACTOOOT OCCCOCATAC 480 
ACTATTCTQA OAATOACTTO OTWASXACT CACeAOTOAO AQAAAACCAT CTTACOOATO 520 
OOA10ACAGT AAOAOAATTA TOCAOTCCTO CCATAACCAT OAOTOATAAC ACTOOeOCCA 600 
ACTTACTTCT OACAACCATC OOAOOACOOA ACOAOCTAAC OOCTTTTTTO CACAACATOG 660 
COOATCATOT AACTCCCCTT OATCOTTOOO AACCOOAOC* OAATOAAOCC ATACCAAACO 720 
AC6A0C0TOA CACCACOAtO CCTOTACCAA TOGCAACAAC OTTOCOCAAA CIATIAACTO 7iO 
0COAACTAOI TACICTAGCI ICCCCCCAAC AATTAATAOA OTOOATOOAC OOOOATAAAO 640 
TTOCAOOACC ACTTCTOCOC TCOOCCCTtC CMCTCeCTO OTTTATTOCT GATAAATCTO 900 
OACCOGQTOA OCCTOOOTCT C6000TATCA TTOCAOCACI OOOGCCAOAT OOTAAOOCC* 960 
CCOOtATOBt AOTTATCTAC ACGACGGCOA OTCA0OCAA C TATOOATGAA CGAAATA6AC 1080 
XOATCOCTOA QATAOCTOCC TCACWATTA AOCATWOTA ACTOICACAC CAAGTTTACT 1080 
CATATATACT WA0ATX0AT ITAAAACTTC AWTTTAATT TAAAAGOATC TACOT0AA6A 1140 
TCCTTTTTOA TAATCTCATO ACCAAAATCC CTTAACOTOA GTTTTCOTtC CACTOACCOT 1200 
CAOACCCCOT AOAAAAOATC AAA00ATOTT CTTOAOATCC TTTTTTtCTO OCCOTAATCT 1260 
OCTOCTWCA AACAAAAAAA OCACOOCTAO CAOOOOTGCT TTGTTTOCCO OATCAAOAOC 1320 
TAOCAACTCT WTTCOOAAO OTAACTCCC* TCAOCAOAOC OCACATACCA AATAOTQTCC 1380 
TOCIAOTOTA OCCOTAOTTA OOCCAOCACI TCAAOAACTC TOTAOCAOOO COTACATAOO 1440 
TOOerCWCT AATCCTOTTA CCAOTOOCTO CTOCCAOTOO COATAAOTOO TOtCWACCO isoo 
OOTTBOACTC AAOAO0ATA9 WACCOOATA AOOCOCASCO OTCOOOCTOA ACOCGCOOTT I960 
COTOCAOACA OCCCAOCTTO 0A0OOAA00A OOTAOACCOA ACTOAOATAC CTACAOCOTO 1(30 
ACCATTOAGA AA0C0C»CG CTTCOOOAAO 0QAGAAAO0C COACAOOTAT CCOOTAACOC 1680 
OCAOCGTCOO AACAOOAOAO CCCACOAOOC A0CPTCCA6C OOOAAACOCC TOGTATCTTT 1740 
ATAOTCCTOT OCGOITTOOC CACCTCTOAC TTOAOCCTCO ATTTTTOTGA TCCTCOTCAG 1800 
GOOOGCGOAO CCTATGOAAA AACCCCAOCA AOOCGOCOTT TTTACGCTTC CTCCCCTTTT I860 
GCTOOCCTTT TOCXCACATO TTCTTTCCTO CCTTATCCCC TCATICTCTO OATAACOOTA 1920 
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FIGURE 7 <conf d) 



TTACOCCCTT 


KAOTOAGCT OATACCCCTC 


GCCCGA0COC 


AACGACCGAG 


CGCACOOAOT 


1980 


CAOTOACCOA 


CCAACCOOAA OAOCOOGCAA 


TAOOCAAACC 


GCCTCTCCCC 


GCGOOTTCGC 


2040 


COATTCATTA 


ATOCAOAATT CATCTCTCAC 


CTACCAAACA 


ATGCCCCCCT 


GGAAAAAATA 


2100 


AATTCATATA 


AAAAACATAC AGATAACCAT 


CTOCOOTOAT 


AAATTATCTC 


TCCCOOTOTt 


2160 


OACATAAATA 


CCACTOGCCC TOATACTCXO 


CACATCAGCA 


0CAC0CACT6 


ACCACCATGA 


2220 


AOGTOACOCT 


CTTAAAAA7? AAOCCCTOAA 


QAAGGGCAGC 


AKCAAAGGA 


GAAOOCTTTG 


2280 


OOOTGTGTOA 


TACCAAACOA AOCATTOGCC 


OTAAGTGCGA 


TTCOGGATTA 


GCTOOGAATG 


2340 


TGCCAATOGC 


OOOGGCTTTT OGTTCAOOAC 


TAGAACTGCC 


ACACACCACC 


AAAGCTAACT 


2400 


OACAGOAOAA 


TCCAOATGOA TOCACAAACA 


COCOGCCOOG 


AACGTCGOGC 


AGAOAAACAO 


2460 


OCTCAATOOA 


AAGCAGCAAA TCCCCTCTTO 


GTTGGOGTAA 


GOCCAAAACC 


AGTTCOGAAA 


2S20 


OATTTTT7TA 


ACTATAAACO CTGATCOAAO 


OOTTTATOCC 


GAAOAGGTAA 


AGCCCTTCCC 


2560 


CAGTAACAAA 


AAAACAACAO CXTAAATAAC 


COO0CTCTTA 


CACATTCCAO 


CCCTGAAAAA 


2640 


GGGCATCAAA 


TTAAACCACA CCTATGCTCT 


ATCCATTTAT 


TTGCATACAT 


TCAATOAATT 


2700 


OTTATCTAAO 


OAAATACTTA CATATOCAAO 


CTAAACATAA 


ACAACGTAAA 


CGTCTGAAAT 


2760 


CTAQCTOTAA 


OAGACACCCT TTOTACOTGG 


ACTTCAGTOA 


OGTGGGOTGG 


AATGACTGGA 


2820 


TTOTGGCTCC 


CCCGOCOTAT CACOCCTTTT 


ACTGCCAOOO 


AOAATGCCCT 


TTTCCTCTGG 


2680 


CTOATCATCT 


CAACTCCACT AATCATOCCA 


TTGTTCAOAC 


GTTGGTCAAC 


TCTGTTAACT 


2940 


CTAACATTCC 


TAACGCATOC TOTOTCCOCA 


CAOAACTCA0 


TGCTATCTCG 


ATGCTGTACC 


3000 


TTOAOOAGAA 


TOAAAACOTT OTATTAAAOA 


ACTATCAGGA 


CATOGTTGTO 


OAOCCTTOTG 


3060 


GGTOTCOCTA 


OTACAOCAAA ATT AAA? AC A 


TAAATATATA 


TATATATATA 


TATTTTAGAA 


3120 


AAAAQAAAAA 


AATCTAGAOT CCACCTOCAG 


TAATCOTACA 


OOOTACTACA 


AATAAAAAAG 


3180 


OCAOCTCAOA 


TOACOTOCCT TTTTTCTTOT 


GAOCAGTAAO 


CTTOCCACTG 


GCCGTCGTTT 


3240 


TACAACOTCO 


TOACTOOOAA AACCCTOGOO 


TTACCCAACT 


TAATCOCCTT 


OCAGCACATC 


3300 


CCCCTT9O0C 


CAGCTGGCOT AATAOCOAA0 


AGGCCOOCAC 


O0ATCGCCCT 


TCCCAACAOT 


3360 


TGC6CMCGT 


OAATOOOOAA TOOCO0OTCA 


TGCOGTATT? 


TCTCCTTAOG 


CATCTGTGO0 


3420 


CTATTTCACA 


CCGCATATAT OOTOCACTCT 


CAOTACAATC 


TGCTCTOATO 


CGOCATAGTT 


3460 


AAOCCAOCCC 


COACACCO0C CAACACCCOC 


TOACacaccc 


TOAOOGCCTT 


OTCTCCTCCC 


3540 


GOCATCCOCT 


TACAOACAAO CTGTOACCGT 


CTOCGCGAOC 


TOCATGTGTC 


AOAMTTTTC 


3600 


ACCOTCATCA 


CCGAAACCCG COA 








3623 
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FIGURE 8 

W-20 ALKALINE PHOSPHATASE: BMP-2 VS. BMP-2/7 
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FIGURE 9 

EFFECTS OF BMP-2 AND BMP2/7 ON BGP SYNTHESIS 
BY W-20 CELLS 




BMP (ng/ml) 



SUBSTITUTE SHEET 



WO 93/09229 



PCT/US92/09430 



25/32 



FIGURE 10 

COMPARAISON OF E.Coli BMP-2 AND BMP-2/7: 
W-20-17 ALKALINE PHOSPHATASE 
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FIGURE 11A 



10 20 30 40 50 60 70 

AGATCTTGAA AACACOCGGG OCACACACGC OGCGACCTAC AGCTCITTCT CAGCGTTGGA GTCGAGACGG 



80 90 100 110 120 130 140 

OGCCCGCAGC GCOCTGCGCG GGTCAGGTCC GCGCAGCTCC TCGGGAAGAG COCACCTGTC AGGCIGCGCT 



150 160 170 180 190 200 210 

GGGTCAGCGC AGCAAGTCGG GCIQGOOGCT ATCTCGCTCC A000GG0O3C GTCOOGGGCT OOGTGOGOQC 



220 230 240 250 260 270 280 

TCGCCCCAGC TGGTTTGGAG TTCAAOCCTC GGCTCCGCCG OCGGCTCCTT GOGOCITOGG AGTGTCCCGC 



290 300 310 320 (1) 335 

AGCGACGCCG GGAGCOGAOG OGOOGOGOGG CTAOCTAGOC ATC GCT GGG GOG AGC AGG CIG CTC 

MET Ala Gly Ala Ser Arg Leu Leu 

350 365 380 395 

TTT CIG TCG CTC GGC TCC TIC TCC GTC AGC CIG GOG CRG GGA GAG AGA COG AAG CCA 
Phe Leu Trp Lsu Gly Cys Phe Cys Val Ser Leu Ala Gin Gly Glu Arg Pro Lys Pro 

410 425 440 455 

OCT TIC COG GAG CTC OGC AAA GCT GIG CCA GGT GAC CGC AOG GCA GGT GGT GGC COG 
Pro Phe Pro Glu Leu Arg Lys Ala Val Pro Gly Asp Arg Thr Ala Gly Gly Gly Pro 

470 485 500 515 

. GAC TCC GAG CTG CAG COG CAA GAC AAG GTC TCT GAA CAC ATC CTC CGG CTC TAT GAC 
Asp Ser Glu Leu Gin Pro Gin Asp Lys Val Ser Glu His MET Leu Arg Leu Tyr Asp 

530 545 560 

AGG TAC AGC ACG GTC CAG GOG GGC CGG ACA COG GGC TCC CTG GAG GGA GGC TOG CAG 
Arg Tyr Ser Thr Val Gin Ala Ala Arg Thr Pro Gly Ser Leu Glu Gly Gly Ser Gin 

57 S 590 605 620 

CCC TCG CGC OCT CGG CTC CTG CGC GAA GGC AAC ACG GIT OGC AGC TTT CGG GOG GCA 

Pro Trp Arg Pro Arg Leu Leu Arg Glu Gly Asn Thr Val Arg Ser Phe Arg Ala Ala 

635 650 665 680 

GCA GCA GAA ACT CTT GAA AGA AAA GGA CTC TAT ATC TIC AAT CTG ACA TCG CTA ACC 
Ala Ala Glu Thr Leu Glu Arg Lys Gly Leu Tyr lie Phe Asn Leu Thr Ser Leu Thr 

695 710 725 740 

AAG TCT GAA AAC ATT TIG TCT GCC ACA CTG TAT TTC TCT ATT GGA GAG CTA GGA AAC 
Lys Ser Glu Asn lie Leu Ser Ala Thr Leu Tyr Phe Cys lie Gly Glu Leu Gly Asn 
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FIGURE 11C 



143 ° 1445 (377) 1460 1475 

TGC GOC AGG AGA TAC CTC AAG GTA GAC TIT GCA GAT ATT GGC TOG AGT GAA TOG ATT 
Cys Ala Arg Arg Tyr Leu Lys Val Asp Phe Ala Asp lie Glv Trp Ser Glu Ttp lie 

1490 1505 1520 1535 

ATC TCC OCC AAG TOC TIT GAT GOC TAT TAT IGC TCT GGA GCA TGC CAG TTC CCC ATG 
He Ser Pro Lys Ser Phe As p Ala Tvr Tvr Cvs Ser Glv Ala Cys Gin Phe Pro MET 

1550' 1565 1580 1595 

OCA AAG TCT TIG AAG OCA TCA AAT CAT GCT AOC ATC CAG AGT ATA GTG AGA GOT GIG 
E* 0 ^ Ser leu Iys Pro Ser Asn His Ala Th r lie Gin Ser He Val Arg Ala Val 

1610 1625 1640 1655 

GGG GIC GIT OCT GGG ATT OCT GAG OCT TOC TCT GTA CCA GAA AAG ATG TOC TCA CTC 
Gly Val Val Pro Gly lie Pro Glu Pro Cys Cys Val Pro Glu Lys MET Ser Ser Leu 

1670 1685 1700 

AGT AIT TTA TIC ITT GAT GAA AAT AAG AAT GTA GIG CUT AAA GTA TAC OCT AAC ATG 
Ser He leu Phe Phe Asp Glu Asn Lys Asn Val Val Leu Lys Val Tvr Pro Asn met 

L 3if (472) 1746 1756 1766 1776 

ACA GTA GAG TCT TGC GCT TCC AGA TAAOCTGGCA AAGAACTCAT TTCAATGCTT AATTCAATCT 
Thr Val G lu Ser cvs Cys Arg 

1786 

CZAGAGTOGA CGGAATTC 
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Figure 12 

0 



W-20 ALKALINE PHOSPHATASE: CHOBMP-?/6v8.CHOBMP^ 
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FIGURE 13A 
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FIGURE 13B 
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